
SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: MEDICAL RESEARCH COUNCIL 

(B) STREET: 20 PARK CRESCENT 

(C) CITY: LONDON 

(E) COUNTRY: UK 

(F) POSTAL CODE (ZIP): WIN 4AL 

(ii) TITLE OF INVENTION: GENE 

(iii) NUMBER OF SEQUENCES: 16 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATIONS. .604 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGTC ATG TTG CGG GCT TTG AAC CGC CTG GCC GCG CGG CCC GGG GGC CAG 
49 

Met Leu Arg Ala Leu Asn Arg Leu Ala Ala Arg Pro Gly Gly Gin 
15 10 15 

CCC CCA ACC CTG CTC CTT CTG CCC GTG CGC GGC CGC AAG ACC CGC CAC 
97 

Pro Pro Thr Leu Leu Leu Leu Pro Val Arg Gly Arg Lys Thr Arg His 
20 25 30 

GAT CCG CCT GCC AAG TCC AAG GTC GGG CGC GTG AAA ATG CCT CCT GCA 
145 

Asp Pro Pro Ala Lys Ser Lys Val Gly Arg Val Lys Met Pro Pro Ala 
35 40 45 

GTG GAC CCT GCG GAA TTG TTC GTG TTG ACC GAG CGC TAC CGA CAG TAC 
193 

Val Asp Pro Ala Glu Leu Phe Val Leu Thr Glu Arg Tyr Arg Gin Tyr 
50 55 60 

CGG GAG ACG GTG CGC GCT CTC AGG CGA GAG TTC ACA TTG GAG GTG CGA 
241 

Arg Glu Thr Val Arg Ala Leu Arg Arg Glu Phe Thr Leu Glu Val Arg 
65 70 75 

GGG AAA TTG CAC GAG GCC CGA GCC GGG GTT CTG GCT GAG CGC AAG GCG 
289 

Gly Lys Leu His Glu Ala Arg Ala Gly Val Leu Ala Glu Arg Lys Ala 
80 85 90 95 

CAA GAG GCC ATC AGA GAG CAC CAG GAG CTG ATG GCC TGG AAC CGG GAG 
337 

Gin Glu Ala He Arg Glu His Gin Glu Leu Met Ala Trp Asn Arg Glu 
100 105 110 

GAG AAC CGG AGA CTG CAG GAA CTA CGG ATA GCT AGG TTG CAG CTC GAA 
385 

Glu Asn Arg Arg Leu Gin Glu Leu Arg He Ala Arg Leu Gin Leu Glu 
115 120 125 

GCA CAG GCC CAG GAG CTG CGG CAG GCT GAG GTC CAG GCC CAG AGG GCC 




433 

Ala Gin Ala Gin Glu Leu Arg Gin Ala Glu Val Gin Ala Gin Arg Ala 
130 135 140 

CAG GAG GAG CAG GCT TGG GTG CAA CTG AAA GAA CAA GAA GTT CTC AAA 
481 

Gin Glu Glu Gin Ala Trp Val Gin Leu Lys Glu Gin Glu Val Leu Lys 
145 150 155 

CTG CAG GAG GAG GCC AAA AAC TTC ATC ACT CGG GAG AAC CTG GAG GCA 
529 

Leu Gin Glu Glu Ala Lys Asn Phe lie Thr Arg Glu Asn Leu Glu Ala 
160 165 170 175 

CGG ATA GAA GAG GCC TTG GAC TCT CCG AAG AGT TAT AAC TGG GCG GTC 
577 

Arg lie Glu Glu Ala Leu Asp Ser Pro Lys Ser Tyr Asn Trp Ala Val 
180 185 190 

ACC AAA GAA GGG CAG GTG GTC AGG AAC TGAGAACAGA GGCCTCTCAG 
624 

Thr Lys Glu Gly Gin Val Val Arg Asn 
195 200 

GCCCAAATAA GGACAGTGCT TGCCTAGGGA CTGGATATTG GGGTAGAAAT 
TGGTGCATCC 684 

CAGGAGGGTG GCACAGCCTT GTCCAGAGCA GCCCCCATTC ATTCTAGATT 
TGGCACCAGG 744 

TATAGTACCT GTTCTGACAC CACATACAAA CTCCGGACAG CATTAAACTC 
TGGGAAGTTC 804 

CTATCACACA GAAGATCAGA CTGGACTGTC CCCTCTAGAA GCCAAGAGCT 
GTCTCCTGAG 864 

TTTCTTGGAA TAGTGTGAGC CCAATGTTTC CTGCTTTTAT AAATAAACTA 
TTGGAAAGCA 924 



(2) INFORMATION FOR SEQ ID NO: 2: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Arg Ala Leu Asn Arg Leu Ala Ala Arg Pro Gly Gly Gin Pro 
15 10 15 

Pro Thr Leu Leu Leu Leu Pro Val Arg Gly Arg Lys Thr Arg His Asp 
20 25 30 

Pro Pro Ala Lys Ser Lys Val Gly Arg Val Lys Met Pro Pro Ala Val 
35 40 45 

Asp Pro Ala Glu Leu Phe Val Leu Thr Glu Arg Tyr Arg Gin Tyr Arg 
50 55 60 

Glu Thr Val Arg Ala Leu Arg Arg Glu Phe Thr Leu Glu Val Arg Gly 
65 70 75 80 

Lys Leu His Glu Ala Arg Ala Gly Val Leu Ala Glu Arg Lys Ala Gin 
85 90 95 

Glu Ala He Arg Glu His Gin Glu Leu Met Ala Trp Asn Arg Glu Glu 
100 105 HO 

Asn Arg Arg Leu Gin Glu Leu Arg He Ala Arg Leu Gin Leu Glu Ala 
115 120 125 

Gin Ala Gin Glu Leu Arg Gin Ala Glu Val Gin Ala Gin Arg Ala Gin 
130 135 140 

Glu Glu Gin Ala Trp Val Gin Leu Lys Glu Gin Glu Val Leu Lys Leu 
145 150 155 160 

Gin Glu Glu Ala Lys Asn Phe He Thr Arg Glu Asn Leu Glu Ala Arg 
165 170 175 

He Glu Glu Ala Leu Asp Ser Pro Lys Ser Tyr Asn Trp Ala Val Thr 
180 185 190 



Lys Glu Gly Gin Val Val Arg Asn 
195 200 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 998 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:1..615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATG CTA CGC GCG CTG AGC CGC CTG GGC GCG GGG ACC CCG TGC AGG CCC 
48 

Met Leu Arg Ala Leu Ser Arg Leu Gly Ala Gly Thr Pro Cys Arg Pro 
205 210 215 

CGG GCC CCT CTG GTG CTG CCA GCG CGC GGC CGC AAG ACC CGC CAC GAC 
96 

Arg Ala Pro Leu Val Leu Pro Ala Arg Gly Arg Lys Thr Arg His Asp 
220 225 230 

CCG CTG GCC AAA TCC AAG ATC GAG CGA GTG AAC ATG CCG CCC GCG GTG 
144 

Pro Leu Ala Lys Ser Lys He Glu Arg Val Asn Met Pro Pro Ala Val 
235 240 245 

GAC CCT GCG GAG TTC TTC GTG CTG ATG GAG CGT TAC CAG CAC TAC CGC 
192 



6 



Asp Pro Ala Glu Phe Phe Val Leu Met Glu Arg Tyr Gin His Tyr Arg 
250 255 260 

CAG ACC GTG CGC GCC CTC AGG ATG GAG TTC GTG TCC GAG GTG CAG AGG 
240 

Gin Thr Val Arg Ala Leu Arg Met Glu Phe Val Ser Glu Val Gin Arg 
265 270 275 280 

AAG GTG CAC GAG GCC CGA GCC GGG GTT CTG GCG GAG CGC AAG GCC CTG 
288 

Lys Val His Glu Ala Arg Ala Gly Val Leu Ala Glu Arg Lys Ala Leu 
285 290 295 

AAG GAC GCC GCC GAG CAC CGC GAG CTG ATG GCC TGG AAC CAG GCG GAG 
336 

Lys Asp Ala Ala Glu His Arg Glu Leu Met Ala Trp Asn Gin Ala Glu 
300 305 310 

AAC CGG CGG CTG CAC GAG CTG CGG ATA GCG AGG CTG CGG CAG GAG GAG 
384 

Asn Arg Arg Leu His Glu Leu Arg He Ala Arg Leu Arg Gin Glu Glu 
315 320 325 

CGG GAG CAG GAG CAG CGG CAG GCG TTG GAG CAG GCC CGC AAG GCC GAA 
432 

Arg Glu Gin Glu Gin Arg Gin Ala Leu Glu Gin Ala Arg Lys Ala Glu 

330 335 340 

GAG GTG CAG GCC TGG GCG CAG CGC AAG GAG CGG GAA GTG CTG CAG CTG 
480 

Glu Val Gin Ala Trp Ala Gin Arg Lys Glu Arg Glu Val Leu Gin Leu 
345 350 355 360 

CAG GAA GAG GTG AAA AAC TTC ATC ACC CGA GAG AAC CTG GAG GCA CGG 
528 

Gin Glu Glu Val Lys Asn Phe He Thr Arg Glu Asn Leu Glu Ala Arg 
365 370 375 

GTG GAA GCA GCA TTG GAC TCC CGG AAG AAC TAC AAC TGG GCC ATC ACC 
576 

Val Glu Ala Ala Leu Asp Ser Arg Lys Asn Tyr Asn Trp Ala He Thr 
380 385 390 
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AGA GAG GGG CTG GTG GTC AGG CCA CAA CGC AGG GAC TCC TAGGGGCCCA 
625 

Arg Glu Gly Leu Val Val Arg Pro Gin Arg Arg Asp Ser 
395 400 405 

GTAAGGACAG TGCCCGCCAG GGACCATGTA TGTATCATGG CGGAAGAGTT 
GGCCCTGACC 685 

TGGAATAAAG CAGTTGGTGT TGCTTATGAG GAAGGTTCAG CCTTATCCAG 
CACAGCCTTC 745 

ACGTTTTGCC CTCTGCTGTC ACCACTTGGT CAGAAACTTC CAAACGCAGT 
GCCCTGTTCT 805 

GCCGGTGTGT AAAGCCTCAG CGCACCAGGA GACCCTAGAG TGGTTTCCAT 
CTCACAGAGA 865 

ATCAGACAGG CCACAGCCCC CTCAGGCAGC CAGGTCATCT GAGTATCATT 
AAGAGTAGTG 925 

ATGGGAAGAT TACAGTCTGA GGGCCAAACG TGCCTGCTTC CTGTTTTTGT 
AAATAAAGTT 985 

TTGTTGGAAC ACA 998 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 205 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Leu Arg Ala Leu Ser Arg Leu Gly Ala Gly Thr Pro Cys Arg Pro 
15 10 15 

Arg Ala Pro Leu Val Leu Pro Ala Arg Gly Arg Lys Thr Arg His Asp 
20 25 30 

Pro Leu Ala Lys Ser Lys lie Glu Arg Val Asn Met Pro Pro Ala Val 
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35 40 45 

Asp Pro Ala Glu Phe Phe Val Leu Met Glu Arg Tyr Gin His Tyr Arg 
50 55 60 

Gin Thr Val Arg Ala Leu Arg Met Glu Phe Val Ser Glu Val Gin Arg 
65 70 75 80 

Lys Val His Glu Ala Arg Ala Gly Val Leu Ala Glu Arg Lys Ala Leu 
85 90 95 

Lys Asp Ala Ala Glu His Arg Glu Leu Met Ala Trp Asn Gin Ala Glu 
100 105 110 

Asn Arg Arg Leu His Glu Leu Arg lie Ala Arg Leu Arg Gin Glu Glu 
115 120 125 

Arg Glu Gin Glu Gin Arg Gin Ala Leu Glu Gin Ala Arg Lys Ala Glu 
130 135 140 

Glu Val Gin Ala Trp Ala Gin Arg Lys Glu Arg Glu Val Leu Gin Leu 
145 150 155 160 

Gin Glu Glu Val Lys Asn Phe He Thr Arg Glu Asn Leu Glu Ala Arg 
165 170 175 

Val Glu Ala Ala Leu Asp Ser Arg Lys Asn Tyr Asn Trp Ala He Thr 
180 185 190 

Arg Glu Gly Leu Val Val Arg Pro Gin Arg Arg Asp Ser 
195 200 205 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:5..604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGTC ATG TTG CGC GCT CTG AAC CGC CTG GCG CAG CGG CCG GGA GAC CGG 
49 

Met Leu Arg Ala Leu Asn Arg Leu Ala Gin Arg Pro Gly Asp Arg 
210 215 220 

CCC CCG ACC CCG CTG CTC CTG CCC GTG CGC GGC CGC AAG ACC CGC CAT 
97 

Pro Pro Thr Pro Leu Leu Leu Pro Val Arg Gly Arg Lys Thr Arg His 

225 230 235 

GAC CCG CCT GCC AAA TCC AAG GTC GGA CGG GTG CAG ACG CCT CCC GCC 
145 

Asp Pro Pro Ala Lys Ser Lys Val Gly Arg Val Gin Thr Pro Pro Ala 
240 245 250 

GTG GAC CCT GCG GAA TTC TTC GTG TTG ACC GAG CGC TAC GGA CAG TAC 
193 

Val Asp Pro Ala Glu Phe Phe Val Leu Thr Glu Arg Tyr Gly Gin Tyr 
255 260 265 

CGG GAG ACC GTG CGC GCT CTC AGG CTA GAG TTC ACG TTG GAT GTG CGA 
241 

Arg Glu Thr Val Arg Ala Leu Arg Leu Glu Phe Thr Leu Asp Val Arg 
270 275 280 

AGG AAA TTG CAC GAG GCC CGA GCC GGG GTT CTG GCC GAG CGC AAG GCG 
289 

Arg Lys Leu His Glu Ala Arg Ala Gly Val Leu Ala Glu Arg Lys Ala 
285 290 295 300 

CAG CAG GCC ATC ACG GAG CAC CGG GAG CTG ATG GCC TGG AAC CGG GAC 
337 

Gin Gin Ala He Thr Glu His Arg Glu Leu Met Ala Trp Asn Arg Asp 




305 310 315 

GAG AAC CGG CGA ATG CAG GAG CTA CGG ATA GCG AGG TTG CAG CTG GAA 
385 

Glu Asn Arg Arg Met Gin Glu Leu Arg lie Ala Arg Leu Gin Leu Glu 
320 325 330 

GCA CAG GCC CAG GAG GTG CAG AAG GCT GAG GCC CAG CGC CAG AGG GCT 
433 

Ala Gin Ala Gin Glu Val Gin Lys Ala Glu Ala Gin Arg Gin Arg Ala 
335 340 345 

CAG GAG GAG CAG GCT TGG GTG CAA CTG AAA GAG CAA GAA GTG CTC AAG 
481 

Gin Glu Glu Gin Ala Trp Val Gin Leu Lys Glu Gin Glu Val Leu Lys 
350 355 360 

CTG CAG GAG GAG GCA AAA AAC TTC ATC ACT CGG GAG AAC CTG GAG GCA 
529 

Leu Gin Glu Glu Ala Lys Asn Phe He Thr Arg Glu Asn Leu Glu Ala 
365 370 375 380 

CGG ATA GAA GAA GCG TTG GAC TCT CCG AAG AGT TAC AAC TGG GCC GTC 
577 

Arg He Glu Glu Ala Leu Asp Ser Pro Lys Ser Tyr Asn Trp Ala Val 
385 390 395 

ACC AAA GAA GGG CAG GTG GTC AGG AAC TGAGCACAGA GACTTCTGGG 
624 

Thr Lys Glu Gly Gin Val Val Arg Asn 
400 405 

GGCCCAAATA AGCACAGTGC TTGCCTAGGG TCTGTGTACT GGGATAGGAA 
TTGGTACATC 684 

CCAGGAGGAT GGCTCAGCCG TTTCCAGAGC AACCTCAGTC ACTCCAGGCT 
CGGCACTCAC 744 

CACCTGACTG GGAACTCCCA GATGTCCCTG TTCTGGCACC ACAGTCAAAC 
TGAGGGCAGC 804 

ATTAAACTCT GGGAAGTTCC TATCGCACAG AGGATCGGAC TGGACTGTGT 
CCCTCTAGAA 864 
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GCCAAGCTTG TCTTGTAAGT CTCTTGGAGT CCTGTGAGCC AAATGTTTCC 
TGCTTTTATA 924 

AATAAAGTAT TGGAGCCCA 943 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Leu Arg Ala Leu Asn Arg Leu Ala Gin Arg Pro Gly Asp Arg Pro 
15 10 15 

Pro Thr Pro Leu Leu Leu Pro Val Arg Gly Arg Lys Thr Arg His Asp 
20 25 30 

Pro Pro Ala Lys Ser Lys Val Gly Arg Val Gin Thr Pro Pro Ala Val 
35 40 45 

Asp Pro Ala Glu Phe Phe Val Leu Thr Glu Arg Tyr Gly Gin Tyr Arg 
50 55 60 

Glu Thr Val Arg Ala Leu Arg Leu Glu Phe Thr Leu Asp Val Arg Arg 
65 70 75 80 

Lys Leu His Glu Ala Arg Ala Gly Val Leu Ala Glu Arg Lys Ala Gin 
85 90 95 

Gin Ala He Thr Glu His Arg Glu Leu Met Ala Trp Asn Arg Asp Glu 
100 105 110 

Asn Arg Arg Met Gin Glu Leu Arg lie Ala Arg Leu Gin Leu Glu Ala 
115 120 125 

Gin Ala Gin Glu Val Gin Lys Ala Glu Ala Gin Arg Gin Arg Ala Gin 
130 135 140 
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Glu Glu Gin Ala Trp Val Gin Leu Lys Glu Gin Glu Val Leu Lys Leu 
145 150 155 160 

Gin Glu Glu Ala Lys Asn Phe He Thr Arg Glu Asn Leu Glu Ala Arg 
165 170 175 

He Glu Glu Ala Leu Asp Ser Pro Lys Ser Tyr Asn Trp Ala Val Thr 
180 185 190 

Lys Glu Gly Gin Val Val Arg Asn 
195 200 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: Rat 5'OT-EST-xdel 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION:1026..1270 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1799..2235 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1030.. 1152 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

TGACCTCTGT GGATCTGATA TACATGTAAG TGACAGACCA TCCGAGCTAT 
ATAGTGAGAC 60 

CTGTGCAAGG AAGGATGGAG TGCACGTTCC CTGATGTTCA GAGCAACCCT 
GTGTCACTCC 120 

AGGTAGGTGA GATGAGAGGA AGAGGGTGGC CTTGGCCTGG GCCTCCTACG 
GGCCTGGAAG 180 

TTGGGAGAAG GATGTAAGCA GACTCTGTTC TCTTCTGAGA AATATCAGGT 
ATTGCAGTCA 240 

GCCCAGGCTC CTCAGACCCT CCTAAGTGCA GATTCTCTGC AGAATCTGGT 
GTTGACAACA 300 

CTAATGAGTA GGATGAGACT TCAGTTCCCT AGCCCTCACC GTCAGCTTCT 
GATTACCAAC 360 

AACTCTCCCA GAGGAGAGCC ATCTACCTTT GGGACAGATG CTCTCTGCCC 
TGCACTGCCT 420 

CCTGTTTCTC TTCATTGTAG AGGAAGATAG TACTTTAAAA GCTTCATAAA 
TGGTCTCAAG 480 

GTGGGAAGAC CCCGGCTCAG GTGAAAGAGG ACAAGCGTCA CCTCACACAG 
GCCACCCAGT 540 

AGAAAACAAG TGATCACTGA TACTGAGAAC TCTGGCAATT GCAGAGCTGC 
CCAAGACCAC 600 

AACAGGGCAG TGCAATGCAA GGAAAAGGTT TGTTGCTCGA TTGCAAACCT 
AAAGTTTAAA 660 

GTGCATCAGG AGAACGCTTA CTCAAAGAGG AAGTGTAAGC CTAACTTAAG 
TAGCTAGAAG 720 

CTCAGAATTT CTTGCATCAG CCCTGGAAGG GTACACAGGC CACCGGTGGG 
CCAGAGAACC 780 

ACACGCTTTG GGGCGGTGTC CAAGCTTGTG AACAAGTAGG CAAGAGCGCC 
TGGTGTTGTA 840 
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GCTGTCATTG GCGGGCAATA CAGCCCAGCG AACTGTGGTC TCCAAGGTGC 
CCCTCGACCC 900 

TCCCACTCTA CCCGAGACTC CAGGGACGCG ATGGGCCAGA CAGCAAGAGC 
TCCGCCTACG 960 

GGGGCGGGGA CAGGAGATTC CCGTGATGCT CCTCGACCAC TTCCGGACAG 
GGCGCAGGCG 1020 

CTAGCTGTC ATG TTG CGG GCT TTG AAC CGC CTG GCC GCG CGG CCC GGG 
1068 

Met Leu Arg Ala Leu Asn Arg Leu Ala Ala Arg Pro Gly 
205 210 

GGC CAG CCC CCA ACC CTG CTC CTT CTG CCC GTG CGC GGC CCA CGG CCC 
1116 

Gly Gin Pro Pro Thr Leu Leu Leu Leu Pro Val Arg Gly Pro Arg Pro 
215 220 225 

CGC TCA TTC TCG GCT CCT TTT TCC TCG CAG GAT AGC TAGGTTGCAG 
Arg Ser Phe Ser Ala Pro Phe Ser Ser Gin Asp Ser 
230 235 240 

CTCGAAGCAC AGGCCCAGGA GCTGCGGCAG GCTGAGGTCC AGGCCCAGAG 
GGCCCAGGAG 1222 

GAGCAGGCTT GGGTGCAACT GAAAGAACAA GAAGTTCTCA AACTGCAGGT 
GGGCCGAGGT 1282 

CGTGAGGAAT GTGGGTATTG GAGATTCCGG TGAGGGAGGC TCTGGGGAGA 
GCAGCACAGG 1342 

GTGTCAAGTG ACCAGTCTTC AGGAGGCTTC TCTCTCTGCT CTGCACACAC 
AGAGTGCCTC 1402 

CCAGACAATG GTCAATGAAA GGTTACAGGC TAGTATTGCC GTGTGAAACT 
TGAAGGTCAG 1462 

GGAAACCATA AATGAGAATG GAGCTGTTTT TATTGTGTAA GGGAGAGTGA 
CAAGGTTGAG 1522 

AGAGTCCACC ACCCCGCACC TCCCCCCGCC CCCAATCAGG TTGTCACGAT 
TCGATTCGTT 1582 
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CTTGGGTTGT GGCTGAGAGA TCTGATGGGT AATTGTCCGA GGAAGAGGGA 
TATAATGGTT 1642 

GAGGTCACCT AGTACAGTTG TGCTGGCCTA TTGGTGGGAC ACTCAAAGGG 
GCCCTGGGCT 1702 

CTTTTGACAC CCTTCTTAAG GTGGGCTAGA GACAGTAAGT TATGCAGGCA 
GCCAGCTCTG 1762 

AGAGATCCCA CGTAGCTAAC CTTTCTCTTC CCGTAGGAGG AGGCCAAAAA 
CTTCATCACT 1822 

CGGGAGAACC TGGAGGCACG GATAGAAGAG GCCTTGGACT CTCCGAAGAG 
TTATAACTGG 1882 

GCGGTCACCA AAGAAGGGCA GGTGGTCAGG AACTGAGAAC AGAGGCCTCT 
CAGGCCCAAA 1942 

TAAGGACAGT GCTTGCCTAG GGACTGGATA TTGGGGTAGA AATTGGTGCA 
TCCCAGGAGG 2002 

GTGGCACAGC CTTGTCCAGA GCAGCCCCCA TTCATTCTAG ATTTGGCACC 
AGGTATAGTA 2062 

CCTGTTCTGA CACCACATAC AAACTCCGGA CAGCATTAAA CTCTGGGAAG 
TTCCTATCAC 2122 

ACAGAAGATC AGACTGGACT GTCCCCTCTA GAAGCCAAGA GCTGTCTCCT 
GAGTTTCTTG 2182 

GAATAGTGTG AGCCCAATGT TTCCTGCTTT TATAAATAAA CTATTGGAAA 
GCAAAGCCTT 2242 

TTGTTATGTG GCTTGCTTTT TCTTGTTGTA GAATAAGTTT ATTTGTCCCA 
GTTATTTGGG 2302 

TCTTAAGGTT ATTAGCCAAA AGCCAGTTCA CCTAACTGAG CCAGGAGTTA 
GTTATCTGCT 2362 

TTGCTCAATC CTGGGCTTTG CTGGGTAGGG TCAGGTGTGT CCAAGGTCCA 
GAAAGCAAAA 2422 

AGGGTGCCCC GTTTCTCCTG GGAAGGCTTC CCCGTCAGTG ATTTCTGTAA 
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CCGGACCCTG 2482 

CCCTGACACA GCGTCATTGG ACTACCCAGC AGACAGTAGA CTCCACTCTA 
AACCCGCTTC 2542 

TTGCGGTCAG TTGCTGTCCT TCAGTGTGTG TAAGCAGTGG CCAGACAGCA 
CCCTTGGGTG 2602 

TCATTTCAAG ACTCTCTCAC CTTGGTCTGC TTTACGTTTG GTTTGATTTG 
GTTTGTTCTG 2662 

GTTTTTGAGA CGAGGCCTTT CACTGGAACC TGGCACTCAG TATTTAGACT 
GCCCAGCCAG 2722 

CTAGCCTCAG AGAATGCATC TGCGTATGCT TGCCTGGCGC TGGAATTCGG 
TGCACATGGC 2782 

TTTGATGTGT ACCGGGGATC AGACACAGAT GTTTCATGAG TGCAGTGCAT 
GCCTGTTAGT 2842 

GGTAGAGCTC 2852 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Leu Arg Ala Leu Asn Arg Leu Ala Ala Arg Pro Gly Gly Gin Pro 
15 10 15 

Pro Thr Leu Leu Leu Leu Pro Val Arg Gly Pro Arg Pro Arg Ser Phe 
20 25 30 

Ser Ala Pro Phe Ser Ser Gin Asp Ser 
35 40 

(2) INFORMATION FOR SEQ ID NO: 9: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTCACACCAC TCTGTCGAAC 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AGGAGGAAGA CAGGTGAAAG 
(2) INFORMATION FOR SEQ ID NO: 1 1: 



20 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 1 : 
TCATGTTGCG GGCTTTGAAC 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TCTTTCAGTT GCACCCAAGC 
(2) INFORMATION FOR SEQ ID NO: 13: 



20 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GTGATAGGAA CTTCCCAGAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GCCTCGTGCA ATTTCCCTCG CACCTCCAAT GTGAACTCTC GC 
(2) INFORMATION FOR SEQ ID NO: 15: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCCTGCGAGG AAAAAGGAGC CGAGAATGAG CGGGGCCGTG GG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 
(B) CLONE: Rat 5'OT-EST 

(ix) FEATURE: 

(A) NAME/KEY: exon w 

(B) LOCATION:1026..1241 

(ix) FEATURE: 
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(A) NAME/KEY: exon x 

(B) LOCATION-.1332..1478 

(ix) FEATURE: 

(A) NAME/KEY: exon y 

(B) LOCATION:1559..1682 

(ix) FEATURE: 

(A) NAME/KEY: exon z 

(B) LOCATION:2211..2647 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TGACCTCTGT GGATCTGATA TACATGTAAG TGACAGACCA TCCGAGCTAT 
ATAGTGAGAC 60 

CTGTGCAAGG AAGGATGGAG TGCACGTTCC CTGATGTTCA GAGCAACCCT 
GTGTCACTCC 120 

AGGTAGGTGA GATGAGAGGA AGAGGGTGGC CTTGGCCTGG GCCTCCTACG 
GGCCTGGAAG 180 

TTGGGAGAAG GATGTAAGCA GACTCTGTTC TCTTCTGAGA AATATCAGGT 
ATTGCAGTCA 240 

GCCCAGGCTC CTCAGACCCT CCTAAGTGCA GATTCTCTGC AGAATCTGGT 
GTTGACAACA 300 

CTAATGAGTA GGATGAGACT TCAGTTCCCT AGCCCTCACC GTCAGCTTCT 
GATTACCAAC 360 

AACTCTCCCA GAGGAGAGCC ATCTACCTTT GGGACAGATG CTCTCTGCCC 
TGCACTGCCT 420 

CCTGTTTCTC TTCATTGTAG AGGAAGATAG TACTTTAAAA GCTTCATAAA 
TGGTCTCAAG 480 

GTGGGAAGAC CCCGGCTCAG GTGAAAGAGG ACAAGCGTCA CCTCACACAG 
GCCACCCAGT 540 

AGAAAACAAG TGATCACTGA TACTGAGAAC TCTGGCAATT GCAGAGCTGC 
CCAAGACCAC 600 
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AACAGGGCAG TGCAATGCAA GGAAAAGGTT TGTTGCTCGA TTGC AAACCT 
AAAGTTTAAA 660 

GTGCATCAGG AGAACGCTTA CTCAAAGAGG AAGTGTAAGC CTAACTTAAG 
TAGCTAGAAG 720 

CTCAGAATTT CTTGCATCAG CCCTGGAAGG GTACACAGGC CACCGGTGGG 
CCAGAGAACC 780 

ACACGCTTTG GGGCGGTGTC CAAGCTTGTG AACAAGTAGG CAAGAGCGCC 
TGGTGTTGTA 840 

GCTGTCATTG GCGGGCAATA CAGCCCAGCG AACTGTGGTC TCCAAGGTGC 
CCCTCGACCC 900 

TCCCACTCTA CCCGAGACTC CAGGGACGCG ATGGGCCAGA CAGCAAGAGC 
TCCGCCTACG 960 

GGGGCGGGGA CAGGAGATTC CCGTGATGCT CCTCGACCAC TTCCGGACAG 
GGCGCAGGCG 1020 

CTAGCTGTCA TGTTGCGGGC TTTGAACCGC CTGGCCGCGC GGCCCGGGGG 
CCAGCCCCCA 1080 

ACCCTGCTCC TTCTGCCCGT GCGCGGCCGC AAGACCCGCC ACGATCCGCC 
TGCCAAGTCC 1140 

AAGGTCGGGC GCGTGAAAAT GCCTCCTGCA GTGGACCCTG CGGAATTGTT 
CGTGTTGACC 1200 

GAGCGCTACC GACAGTACCG GGAGACGGTG CGCGCTCTCA GGTGTGTGTA 
AAGGGCAGGC 1260 

GGCCTTCGGC GCCCCCTGGG AAGTGCTGGG GCTGGAGGAT GGGTGCTCAC 
TTGAAGCCCG 1320 

TCCTCACCCA GGCGAGAGTT CACATTGGAG GTGCGAGGGA AATTGCACGA 
GGCCCGAGCC 1380 

GGGGTTCTGG CTGAGCGCAA GGCGCAAGAG GCCATCAGAG AGCACCAGGA 
GCTGATGGCC 1440 

TGGAACCGGG AGGAGAACCG GAGACTGCAG GAACTACGGT GCGAGAGGCG 
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CGGGGCTGGG 1500 

TGGGCTGGGC TAGGCTCACC CACGGCCCCG CTCATTCTCG GCTCCTTTTT 
CCTCGCAGGA 1560 

TAGCTAGGTT GCAGCTCGAA GCACAGGCCC AGGAGCTGCG GCAGGCTGAG 
GTCCAGGCCC 1620 

AGAGGGCCCA GGAGGAGCAG GCTTGGGTGC AACTGAAAGA ACAAGAAGTT 
CTCAAACTGC 1680 

AGGTGGGCCG AGGTCGTGAG GAATGTGGGT ATTGGAGATT CCGGTGAGGG 
AGGCTCTGGG 1740 

GAGAGCAGCA CAGGGTGTCA AGTGACCAGT CTTCAGGAGG CTTCTCTCTC 
TGCTCTGCAC 1800 

ACACAGAGTG CCTCCCAGAC AATGGTCAAT GAAAGGTTAC AGGCTAGTAT 
TGCCGTGTGA 1860 

AACTTGAAGG TCAGGGAAAC CATAAATGAG AATGGAGCTG TTTTTATTGT 
GTAAGGGAGA 1920 

GTGACAAGGT TGAGAGAGTC CACCACCCCG CACCTCCCCC CGCCCCCAAT 
CAGGTTGTCA 1980 

CGATTCGATT CGTTCTTGGG TTGTGGCTGA GAGATCTGAT GGGTAATTGT 
CCGAGGAAGA 2040 

GGGATATAAT GGTTGAGGTC ACCTAGTACA GTTGTGCTGG CCTATTGGTG 
GGACACTCAA 2100 

AGGGGCCCTG GGCTCTTTTG ACACCCTTCT TAAGGTGGGC TAGAGACAGT 
AAGTTATGCA 2160 

GGCAGCCAGC TCTGAGAGAT CCCACGTAGC TAACCTTTCT CTTCCCGTAG 
GAGGAGGCCA 2220 

AAAACTTCAT CACTCGGGAG AACCTGGAGG CACGGATAGA AGAGGCCTTG 
GACTCTCCGA 2280 

AGAGTTATAA CTGGGCGGTC ACCAAAGAAG GGCAGGTGGT CAGGAACTGA 
GAACAGAGGC 2340 
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CTCTCAGGCC CAAATAAGGA CAGTGCTTGC CTAGGGACTG GATATTGGGG 
TAGAAATTGG 2400 

TGCATCCCAG GAGGGTGGCA CAGCCTTGTC CAGAGCAGCC CCCATTCATT 
CTAGATTTGG 2460 

CACCAGGTAT AGTACCTGTT CTGACACCAC ATACAAACTC CGGACAGCAT 
TAAACTCTGG 2520 

GAAGTTCCTA TCACACAGAA GATCAGACTG GACTGTCCCC TCTAGAAGCC 
AAGAGCTGTC 2580 

TCCTGAGTTT CTTGGAATAG TGTGAGCCCA ATGTTTCCTG CTTTTATAAA 
TAAACTATTG 2640 

GAAAGCAAAG CCTTTTGTTA TGTGGCTTGC TTTTTCTTGT TGTAGAATAA 
GTTTATTTGT 2700 

CCCAGTTATT TGGGTCTTAA GGTTATTAGC CAAAAGCCAG TTCACCTAAC 
TGAGCCAGGA 2760 

GTTAGTTATC TGCTTTGCTC AATCCTGGGC TTTGCTGGGT AGGGTCAGGT 
GTGTCCAAGG 2820 

TCCAGAAAGC AAAAAGGGTG CCCCGTTTCT CCTGGGAAGG CTTCCCCGTC 
AGTGATTTCT 2880 

GTAACCGGAC CCTGCCCTGA CACAGCGTCA TTGGACTACC CAGCAGACAG 
TAGACTCCAC 2940 

TCTAAACCCG CTTCTTGCGG TCAGTTGCTG TCCTTCAGTG TGTGTAAGCA 
GTGGCCAGAC 3000 

AGCACCCTTG GGTGTCATTT CAAGACTCTC TCACCTTGGT CTGCTTTACG 
TTTGGTTTGA 3060 

TTTGGTTTGT TCTGGTTTTT GAGACGAGGC CTTTCACTGG AACCTGGCAC 
TCAGTATTTA 3120 

GACTGCCCAG CCAGCTAGCC TCAGAGAATG CATCTGCGTA TGCTTGCCTG 
GCGCTGGAAT 3180 

TCGGTGCACA TGGCTTTGAT GTGTACCGGG GATCAGACAC AGATGTTTCA 
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TGAGTGCAGT 3240 

GCATGCCTGT TAGTGGTAGA GCTC 3264 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Cosmid DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GCGGCCGCAT AATACGACTC ACTATAGGGA TCTGGTGGAG GACCTATGGC 
CCGCGAGCTA 60 

GAGAAGTGGT TCTCAACCTT CCTAGTGCTG AGACCCTTTA ACACAGTTCC 
TCGTGTTGTG 120 

GGGAAACCCC CTCCTGCAAC CATAAAATAA TTTTTGTTAC TACTTCATAA 
CAAGTGTTGC 180 

TACTCTATTG CTATGAATTG TAAAATAAAT GTGTCTTCCA ATGGTCTTAG 
ATGACTCCCG 240 

TGAAAGGGTC ATTCTACCCC TAAGAGGTCA TGATCTACAG GTTGAGAACC 
ACTGATCTCC 300 

AGTAACCTTC ACTTGAGTCC ATATCCTCCA TGAAGGTATG GAAGTCAATA 
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AAACTGAGCT 360 

TCAAGCCTCA TCAAAATGGG TCCATCCCCT GGTACAGTGT GAGTGGAAGA 
ATACCCACCA 420 

TACGGTCACT GGAAGGAGGA TGTCTGAAGG GTCTTAGATT GTGTCAAGGG 
GTCCTGGGTG 480 

TCAGGATCTG ACGAAGCAGG CTCGTCATGT TTCATGAAGA CTACAGGTAT 
GTGATAAAAC 540 

TGCAAGCTGG AAAAGTACCC ACTGAGCCCG TGTGGCTCTG CTGGGATTTG 
GAGGCATGAG 600 

GAGCAGAGGG TCTGGAGGAC AGCAGTCCCA GAAATAATCT ATGACTAAGA 
AGGCTGAACT 660 

GGGGTGACTC TCTGGTGGAA AGAGTTGCCT TTTAAGAAGG AAGACATACC 
AGGCATAGCA 720 

ACAACTGCCT TTAGTACTAG CACTCTGAAG GCAGAGGAAG TCCGATTTCT 
CTGAGTTCCA 780 

AGCCAGCTTG GTTTACACAG CAAGTTCTAG GCCAACTAGG GTTACATAGT 
GGACTCTCCT 840 

CAAACGGGGT TGAGAAAGGA CTCAGCAGTT AGCTCAGTTA ACTCCAGTTC 
TAGGAAATAT 900 

GATCCCTTAG TCTGACCTCT TGGCATGTAA GTGGTGCACA TACATATATG 
CACACAAAAT 960 

ACATCAATCT GCAAAGGGGG AGGGAGGAAG GGCTGGAGTC TGAAGAAATA 
GTTCAGTGGT 1020 

TAAGAGAATT CACTGCTCTT CCCAATAGCC AAATTCAGCT CCTAGCATCC 
ATGTCAGATG 1080 

GCCCACGAAC ACCTGTAATT CTAGCCCCTA AACTCAGTGC CCCTTCACAA 
GACGGGGACA 1140 

CACGTACACA TATACCTAAA AAATTAGGTG GTTTTTTTTT ATTTATAAGG 
TCAAATGCAG 1200 



27 



AATATCAAAT GGGTTAGACA GCAGCTCCAA GCTGGCCTCT TCCTCCCAGG 
GCTCTTCTTG 1260 

ACTCTTGGCA CCCTCTTTGG GTCCAGAACC CAGACATTAG CCATGACTCA 
GCTGATAAAA 1320 

TGCAACCCAT GGCTCATTAA TTAGGAAGTC TGTAATTAGC CTGTCTGGTA 
GCCTCCAGAG 1380 

AGAACCCCTT TCACCTGTCT TCCTCCTCTC ACCCAGGGGA AGAGCTCAGT 
TTTGCCCCTG 1440 

AGACAGAAGA AGGGAACGAG ACCATGAGCA ACGGGAAATG AGATGCTGGC 
GCACACACAC 1500 

TTTATGTGTG TGAAGTCTCA GAGAGGTCAC CAATAATGAG GCAATGGAAA 
TGAGCTGAGC 1560 

TGCCTGAACC TCCAAGTTTC CTCCAAGAAA ACCCCACAGG GGAGATGGGG 
CATGGCCCAG 1620 

GCCAGCTGCC CCAGCCTCTG CTGGCAGAAA GTGAGCCCGC TGCCATTTTA 
ATTTTTGATA 1680 

CAGGGTCTCA CTCTACAGCT CTGGGGGCCT AAAACTCACT ATGTAGACTT 
CAAACTCAAC 1740 

CAAACCAACA ACAAAAACAA ACAAAACCCC TGCACTGACT GGAGAGATGG 
CTCGGTTGAG 1800 

AACAATGGCT GCTAGGAGTC AAACCCAGGT CCTGTGGAAG AGCATGCTGG 
TAACTGCTGG 1860 

GTCATCGCTG GGTCACTCTC TTCACACACA CACACACACA CACACACACA 
CACGGCAATG 1920 

AACTCTTCAG TGTCTTGATT TACGGTTTCT TCCGATAAAT CCTCAGGAGG 
GCAGTCAAGT 1980 

GGCTCATTTG GCAAATGCTT GCCTGAGACC TGAGTTTGGT TCCCAGAACC 
CATGGAGGCA 2040 

GAAGGAAAGG GCTCCACAAA GCTCTCTTCT GAACTCCATA TGTGCACACA 
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CACCCACTTC 2100 

GCACAC ATTC ATAATAGTGA TGAATGAAAA TGAAGACAGA TAAAAAAAAC 
CAATTTCGTG 2160 

AAACTGTTAG CACGTTCAGT CAATGGCTTT GGGGGTAACC TGTTTCAGAG 
CCATGGTACT 2220 

CAGTCACTAG GCTCATACTG GTCAGACGCT GAGGTCAGCA ATGGAGAGCT 
GCTACACCTA 2280 

AAGGTAGCAG AGGTCATTTG GCTCTGACTC AGAATATTCC AGCTCTCCAC 
ATTCACAGAA 2340 

GTTCTACTTG GTCGTAGAAA AAAGCTGAGC CTTTTTTTTT TTTTGGAACT 
TTATTTTTTT 2400 

AAAGATATAT TTATTTTATG TATATGAGTG CACTGTAGCT GTCTTCAGAC 
ACACCAGAAG 2460 

GGGGCATCGG ATCCCATTAC AGATGGCTGT GAGCCAACAT GTGGTCGCTG 
GGGATTGAAC 2520 

TTAGGACCTC TGGAAGAGCA GTCAGTGCTC TTAACCGCTG AGCCATCTCT 
CCAGCCCTGG 2580 

AACTTTATTT TGAACATGCA ACCCCACCTA CCACTATGGG TTCAGTCACC 
AGCGCCTTAG 2640 

GAATAAAATT GGAGAAAATA AGCTTTATGG TTAGTCAGCT GTCAGCTGTG 
GGGTTGGGGA 2700 

CAGAAGAATG GTTATGTTTT GTTTTCCCAT CAAGGCCTCA CTCTGTGACC 
TGGTTGGTCT 2760 

GCCACTTGCT CTGTAGATCA GGTTTCAATT ACAGAGATCC ACCTGCTCCG 
TGTCGCTATG 2820 

CTGGGATAAG AACTAAGTCA CCCTGCCTAC CTTATTACTT TGTATTCTTG 
GGCATGGAAC 2880 

TCAATTCCTT GTCAGCGAGA GAATAACTTC CTCGATCGGA GTGTTTTTAT 
GTGAATTGGG 2940 
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CCAAAAAGAC TGCGATGCTC TGAGACCTAT TTGTGAAGCC AAGAGTAGTG 
GGTAGCACAA 3000 

GTCAGAAATC CAAGGACTTG GTAAGCTGAG ACAGTAGGAT GTGTGCGCTC 
ATGCACACAC 3060 

ACACACACAG ACACACACAC AGACACATAC ATGCATGGAC GCACAGAGGC 
ACCCACGCAC 3120 

ATGTGCCTGG ATGAGGCTTC AGTTCTTCAT AAAGCTGCCT TTGAGTTTGT 
GCCCTCCCAC 3180 

TCTTCCTGAG GACTGGAGTC CTCACACCTT GGGCTGATAG TGCACCACTA 
CCTTTTTTAG 3240 

TGACCTCCTC TTTGCAGTCA CAGGCTGAAG GTACAGGGAG GACTCTAGCG 
GCCGTCTGCC 3300 

TCTGTTTAAC ATGAACCTGC AAGGCAGTGG GCAGCCTCAC CCCTAGCGAT 
GGCACTGAGT 3360 

GATGCCAGGA ACGCTGTCCT CATGTGCCCT TGGCTGTTGG GGCACAGTGT 
GCCTCTGCAG 3420 

GGCCAGCCTG ACCGTGTGTG CCAGCCAGAA TGCACAATTT CTGCCCGACC 
TTGGAAGCTT 3480 

TTTGTCTTTC CTTGTGAGTT TCTTGTCACC CAGCAGTGTT TCTTGCCTCT 
TTGCTTGACG 3540 

CCTCTATGGG AAGATGGACA AGACTTTTTT TTTTCTACAT CCCCTGCAAA 
CAGGTTTGTC 3600 

ATACCTCTCA GGGGCAGGGG TCTTGTCCCT GTCAAGCGCA GCAGGCCACC 
AGACCCAGAA 3660 

CTATGAAATC TACCCAACTT GTCTCTGTAC AAAGTTAAAC AACAAAAAGA 
AACTTGGTTT 3720 

TGTTTTTGTT TTTTTTTTTG TTTTGTTTTG TTTTGTTTTT TGAGACAGGG 
TTTCTCTATG 3780 

TAGTCCTGGC TATTCTGGAA CTTGTTCTAT AGACCAGGCT ATCCTGGAAC 
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TCAAAGAACG 3840 

GCCTGACTCT GTCTCCCGGG TGCTGGTCAC TCTGAAGATC TGTGCCACCA 
TCATCAGGCT 3900 

GGGTTTTAAA AGATTATGGT TTATATTTAA TGTGTATGAG TGTTTTGTTT 
GCATGTATAT 3960 

CTGTACATGA CAGGTGTGCC TGGTGTTTAC AGAGGCCAGA AGAACATACC 
AGATCCCCCT 4020 

GGAACTGAAG TTACAGACAG TCGTGAGCCA TCTCGGGGTT GCTGGGGACA 
GAATCCGAGG 4080 

GCTCTTCTTG AGTAGCAAGT GCTTCTAACC GCTTAGGCCT CTCTGCAGCC 
CCCACTTACA 4140 

GGATTTAAAG GTAGAACAAG GTTTGTCACC TGTCCTGGAG ACCCTGGCCT 
TTAATTCCAG 4200 

AACTCTGGAG GTAGAGACAG ATGATTCTCT ATGAAGTTCA GGCGAGCCTG 
GTCTACACAG 4260 

AGTGCCGCAT GATAGCAAGA AGAAGATCCT GTCTTTAAAA GAGACGAGAG 
GGGTTGGGGA 4320 

TTTAGCTCAG TGGTAGAGCG CTTGCCTAGC AAGCACAAGG CCCTGGGTTC 
GGTCCCCAGC 4380 

TCCGAAGAAA AAAAAGAGAC GAGAGCCAGT GGTTGGTGCA CGTCTTTGAT 
CCCAGTACTC 4440 

TGGAGGCAGA GGTAGTGGAT CTCTCTTGAG TTCAAGGACA GCGTGGTCTA 
CAAAGTGAGT 4500 

TTTAGGACAT CCAGGATTAC ACGCACAGAA ACCTTGTCTC ATAAAACAAC 
AAACAAGACA 4560 

AGACAGAAAC TCTCCTAACG TAGACCGCCA CACCTGATTT TTAAAAGCTC 
TCAGTGAAAC 4620 



TGAGCATGGT AGCACATGTT TGTAATCCCA GCAGACATGT GGGGAGACAA 
AGGAATGGAC 4680 
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TCAGACTCAG CCGGAGAGCA AGTTCACGGC TAGACTGGAC CATTCCTACA 
ATGAGGTAGG 4740 

AATTGGGGTT AGCACATCAA GTAAGTAACC CTGGAAACAA GTTTGACTTG 
TCCAAGGTCA 4800 

CACAGCAATG TCTGGAAAGC TAAGTCTGGT TCCAAGGCCC CCCCTTCCTC 
CCTCTCTCCC 4860 

TCTCTATAAT TGAAAAGTCC ACTGCTTGGC AAAAACTCCC AGGACTATAT 
TAAACACAAA 4920 

TGCTGGTGTT CTCCATGTCT TAGGGCTTTT ATCCTAGAAG GAATTCAAAC 
ACACAACACG 4980 

AATACCCCAC AGAAAGGAGG GCAGGGTGGA GGGGTAAGGG AGAGAGGAGG 
AACTTCAGGC 5040 

TACTGGGGGT ATTAACCAGC TCTGTACCCC ATCCACACAG ACCCAAGTTA 
GAAAAGAGCA 5100 

GGAGAGGGGG TCTGGAGAGG TTGTTAACTG GCCCAGCAGT TTGGCCTGCT 
CTTGCAGGGG 5160 

CCCAGCTCTG TTCCCAGCAC CCATTTCAGT GGCTCACAAC TTTTAACTCC 
AGCCCCAAGG 5220 

ACTCTGCTTC CCTCTGAGAG CTCTGTACTT AACAGGGACA CACAGACACA 
TACAATTAAA 5280 

AAAATGTTTT AAAGTGAGAG ACGCTCTAGA CAGGCTAGCA AGTATTGAGT 
TGTGGCAGGT 5340 

ACAGCTATTT TAATAGTGAT TTCAGGTTAG AACCCTGGGG AGGGGGAACC 
AGGAGTTAAA 5400 

CTATGTTAAA TCAGAAAGAC CCAAAGCCAA TCTGGTGGAA GCTGCCATTG 
GAGGTTCTAA 5460 

CAAGTCTGGC TTGTCAGGGA AAGGCTCAGA ATGAAGGTTT GAGCTGGGGC 
ATCATTAGTG 5520 

TATAAAAAGT ATGAAAACAC TCTAGGAAGA AGACAAGAGG AGGAACACCA 
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CGGAGAGCGA 5580 

GCCTTACGAT GTTCCAGCAC GTAGACGCCA AAGTGAAGCC AGGAAACCAA 
GCACAGGGAC 5640 

CAGGAAAGCC CAAAGTTCAT TGTGAAAAGG ACAAAGGCTT CATCCTGGGA 
AACTAGGCTG 5700 

GGAGAGGCCG TGTTAAATAA AGACAGACAC ACCCATCAAA ATGACCCACA 
GAGGGCTTCA 5760 

TGATTACAAT AGTATTTCAT AGGCGGATTT GGGCAGAAAT CTGAATGCAG 
GGGATTACAG 5820 

AGTAAATGCT GACTTTTGGA TAAGAATGGC AGATCACAGG ACAGGTGTGT 
GACTCACATC 5880 

TTTAAAGCAC ACTCCCAGGG CAGAGGTAGT GAGTTTGAGT TTAGGGGCTA 
GTCTGGTCTG 5940 

GACTGGAAAT TCTATGAGAC CCTGTTCTCA AAAACTAAAG TATTTGGGAA 
AAAAGAACTT 6000 

CTGAGGGAAA TGGAGGCCGT GTAGGTCTCT CTGGGAGCCC GTGCGGCAGG 
TGGCGAGGGA 6060 

GGATCTGAAA TGGGGAGAGT CAGCAGACTG CTGGACCTTT CCTAGCCAGC 
AGAGATGCTA 6120 

AGGCAGGTGA AGATTAGGTC TCATGGACCT GACACCCGTG CACACAGGCA 
GCATGGCGCC 6180 

TTCAAAGCTC TAGTGGATGT GATTGCCCCA GACAAGTCTG CCCCAAAGCT 
CATCTTCGTC 6240 

CATTAATAGA AAAAAGGTTT CTTCTGACCA AGGAAGCTGT TCTCTCTGGA 
AAACAATCAC 6300 

TTAACAAGGA CATTACTAAC ACGAAGCTGC TGTCCGATCA CATCACCATG 
ACGCAAGCAC 6360 



TTCCCTTGGG GTTCATACGC AGTGACTCAG TGCTCACGAC CCTGTGCTAG 
GCTTGGCCCT 6420 
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CACTCCTTTT CCGCTGGAAT TAAGTGGGGA GTCAGACACC CCAGAGGACC 
TGCCCAAGCC 6480 

AGAAAGCTTC AAGCCACAGG AGCCAGTGTG TCCTTGGCTT CCCTACACAT 
GAGCTGTCTC 6540 

TTATCCTCGA TCGAGGGCCT CACAGTCATT CCTGAAAAGA TCTGGCCCCC 
AGCCCTGAGT 6600 

ATGGAAGGCT AACTTGGCTA CCAGTCCCCA CTGTCCTTAT TAGGAAGAGG 
CAAAACCGTC 6660 

CTCTGGCACT CTCTTGAAGC ATACTGGTAT ATCCGAGAGA GGTAACAGGA 
GCCGATGGGA 6720 

GCTGGGAGGG TCCTGGCCTA GGCATAGTCT AGAAGACTTG GGCTAAGTAG 
TCTGGGTCCC 6780 

CAAACCATAA CATTTTTCTG GTGACTAAAG AAAAGGAGTC TGTAAGCCTA 
AAGCAGAATG 6840 

TGGTGATACA CGCCTACAGT CCTAGCACTG GAGAGGTGGA GATAGAAAGA 
TCAAGAGTTC 6900 

AATGCCAGCT TTCTGCTATG TAGTAAGGTC AAGGTCAGCC TGGACTAAAC 
GACTGCCTTA 6960 

GAAACAACCA AATGACTTAC CGTCTAAAGT CAGGAACTAC ACTTGCTTTC 
TCAGACTGTG 7020 

TCTGTCTGTC TGGGGCTCCT CCCATTTCCT CTCCTAACAA CATCCACTTC 
CACTCCTGCC 7080 

TTAGATCTGA GATAGTACCA GCCTCAGGGC ATGGGGTCTC CCCATAGCTT 
TTCCTCTGCA 7140 

GTACTGTGGG CTCACCTAGG ACTGTTTCTG AACTATATCC TACCCTAGCT 
CTCTACCCTA 7200 

GAAGGCCTGA AACTCACAGA AATTCTCCTG CCTCTGCTTT CCAATGGCTG 
GGGTTAAAAG 7260 

CATGTGTCAC AACTGTCCTT TTTATTCTTT TAATATCGAG ACAGGGTCTC 
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ACCAAGTTGC 7320 

CCCAAGACGC CAGCCACACC TGGGACAGGG CAGGCCTTTG GCTCTATGTT 
CAGTCTTGAC 7380 

TCCATGACTG TGGCCGCTAG CCCATGAGGC TGCGCGTGGG AATTTCCTTC 
TGAAAGCTCA 7440 

CCTGGTATCG ATGCTTCCTC TTATCCTACA CCACAACTAA CAAACCTGCC 
CCACCTCCTG 7500 

GTCCTGACCC TGCTGCAGAC CTGCTAGTCC TTGGTGAATG AGACCTGGGG 
ACCCCTCTAG 7560 

TCTGTTGAGA GCTGCTGAAA TGCTCAACTA TGATTTCCAG GTGACCCTCA 
AGTCGGCTCA 7620 

CCTCCCTGAT TGCACAGCAC CAATCACTGT GGCGGTGGCT CCCGTCACAC 
GGTGGCCAGT 7680 

GACAGCCTGA TGGCTGGCTC CCCTCCTCCA CCACCCTCTG CATTGACAGG 
CCCACGTGTG 7740 

TCCCCAGATG CCTGAATCAC TGCTGACAGC TTGGGACCTG TCAGCTGTGG 
GCTCCTGGGG 7800 

AGCCACTGGG GAGGGGGTTA GCAGCCACGC TGTCGCCTCC TAGCCAAC AC 
CTGCAGACAT 7860 

AAATAGACAG CCCAGCCCGC TCAGGCAGCA GAGCAGAGCT GCACGACGCG 
TCGATCCCAA 7920 

GGCCCAACTC CCCGAACCAC TCAGGGTCCT GTGGACAGCT CACCTAGCTG 
CAATGGCTAC 7980 

AGGTAAGCGC CCCTAAAATC CCTTTGGCAC AATGTGTCCT GAGGGGAGAG 
GCAGCGACCT 8040 

GTAGATGGGA CGGGGGCACT AACCCTCAGG GTTTGGGGTT CTGAATGTGA 
GTATCGCCAT 8100 

CTAAGCCCAG TATTTGGCCA ATCTCAGAAA GCTCCTGGCT CCCTGGAGGA 
TGGAGAGAGA 8160 
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AAAACAAACA GCTCCTGGAG CAGGGAGAGT GTTGGCCTCT TGCTCTCCGG 
CTCCCTCTGT 8220 

TGCCCTCTGG TTTCTCCCCA GGCTCCCGGA CGTCCCTGCT CCTGGCTTTT 
GGCCTGCTCT 8280 

GCCTGCCCTG GCTTCAAGAG GGCAGTGCCT TCCCAACCAT TCCCTTATCC 
AGGCTTTTTG 8340 

ACAACGCTAT GCTCCGCGCC CATCGTCTGC ACCAGCTGGC CTTTGACACC 
TACCAGGAGT 8400 

TTGTAAGCTC TTGGGGAATG GGTGCGCATC AGGGGTGGCA GGAAGGGGTG 
ACTTTCCCCC 8460 

GCTGGAAATA AGAGGAGGAG ACTAAGGAGC TCAGGGTTTT TCCCGACCGC 
GAAAATGCAG 8520 

GCAGATGAGC ACACGCTGAG CTAGGTTCCC AGAAAAGTAA AATGGGAGCA 
GGTCTCAGCT 8580 

CAGACCTTGG TGGGCGGTCC TTCTCCTAGG AAGAAGCCTA TATCCCAAAG 
GAACAGAAGT 8640 

ATTCATTCCT GCAGAACCCC CAGACCTCCC TCTGTTTCTC AGAGTCTATT 
CCGACACCCT 8700 

CCAACAGGGA GGAAACACAA CAGAAATCCG TGAGTGGATG CCTTCTCCCC 
AGGCGGGGAT 8760 

GGGGGAGACC TGTAGTCAGA GCCCCCGGGC AGCACAGCCA ATGCCCGTCC 
TTGCCCCTGC 8820 

AGAACCTAGA GCTGCTCCGC ATCTCCCTGC TGCTCATCCA GTCGTGGCTG 
GAGCCCGTGC 8880 

AGTTCCTCAG GAGTGTCTTC GCCAACAGCC TGGTGTACGG CGCCTCTGAC 
AGCAACGTCT 8940 

ATGACCTCCT AAAGGACCTA GAGGAAGGCA TCCAAACGCT GATGGGGGTG 
AGGGTGGCGC 9000 

CAGGGGTCCC CAATCCTGGA GCCCCACTGA CTTTGAGAGA CTGTGTTAGA 
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GAAACACTGG 9060 

CTGCCCTCTT TTTAGCAGTC AGGCCCTGAC CCAAGAGAAC TCACCTTATT 
CTTCATTTCC 9120 

CCTCGTGAAT CCTCCAGGCC TTTCTCTACA CTGAAGGGGA GGGAGGAAAA 
TGAATGAATG 9180 

AGAAAGGGAG GGAACAGTAC CCAAGCGCTT GGCCTCTCCT TCTCTTCCTT 
CACTTTGCAG 9240 

AGGCTGGAAG ATGGCAGCCC CCGGACTGGG CAGATCTTCA AGCAGACCTA 
CAGCAAGTTC 9300 

GACACAAACT CACACAACGA TGACGCACTA CTCAAGAACT ACGGGCTGCT 
CTACTGCTTC 9360 

AGGAAGGACA TGGACAAGGT CGAGACATTC CTGCGCATCG TGCAGTGCCG 
CTCTGTGGAG 9420 

GGCAGCTGTG GCTTCTAGCT GCCCGGGTGG CATCCCTGTG ACCCCTCCCC 
AGTGCCTCTC 9480 

CTGGCCCTGG AAGTTGCCAC TCCAGTGCCC ACCAGCCTTG TCCTAATAAA 
ATTAAGTTGC 9540 

ATCATTTTGT CTGACTAGGT GTCCTTCTAT AATGACGCGT CGTGCCCACC 
TATGCTCGCC 9600 

ATGATGCTCA ACACTACGCT CTCTGCTTGC TTCCTGAGCC TGCTGGCCCT 
CACCTCTGCC 9660 

TGCTACTTCC AGAACTGCCC AAGAGGAGGC AAGAGGGCCA CATCCGACAT 
GGAGCTGAGA 9720 

CAGGTACCAC TGTGGTCCGT TCAGGGCTGC TGACAGTGCC GTAGGAAGGG 
TCATGGGCTA 9780 

GGAGAGAGGG AAACCTTGTC TGAGCAGTCA GACTTTAGGG GAGGTTCCTG 
GAAGGAAGCA 9840 

GTTATCTTAT ATGGAGTAGA TGGGTTTCCC AGAACGGTAA GAGGGGACCA 
GGTGCCAGAG 9900 
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AAGCCACATA AAGGACAGTG TCCCCAGGCA GGGGATATGC CAGAAAATGA 
GAGATACTTA 9960 

TCACTGGGCT TGGGATGAGA ACGGGTTAAA CTGGGTACCC TGGCCTCCTC 
TGCACAGCTG 10020 

GAGGTGGCCG GTGGTATGTT GGCTCACCAG GACTGGGTAG ATGGTACGAA 
ACTGTTCTCG 10080 

CCTGAGTACA AAGCCTTTCC CACCCAGCTC AAACTCTCTT AGCTCCTTTT 
TTAGCCAGCT 10140 

GCACCGGTTT CTTCCTGTCC ACGGAAGACG GCCATTGCCC TGTGTCTGAG 
CGGAGTATGT 10200 

CCCACATCTA GCCTCAGCCT CGTGCCCAGA TCTGCTGTAC TGTATGTTCA 
GCTCTGAGTC 10260 

TGCCCTTCCG GCAGGGCTGA AGGGAATCCA GTCACTAGGC TCAAATCTGG 
TCAGGTCACA 10320 

GGTGGCTCAG TTTTGAACAA GCTCGATGGG CAGTAGGCAG TTCACCGAGT 
CTGCCTTCCG 10380 

TTTGCTGAGT TCCTTTGGAG ACTTCCGAGG CACTAGGTGT GTCTTGCACC 
CATCAGCCTA 10440 

ATTCGGTCCT TGCCACCTTC CTACTAGGGC ATAATAGGTT GGCGGGAGGT 
AAAAGCCCAC 10500 

CAGCGTGGGG CAGGGGTAAG AGTGAGCGAG CCGTAGGTAC AGGAAAGAGG 
ATCTTGGAAT 10560 

GTGTAGGGCC ATCTGAATGT CGGAGAGGTA AGTCTCTGAG AGACTGCTGC 
ACACCGGTGA 10620 

CACATCAGAG CTGAGGAGGT CCCCCAAGTG TTGTCTCCCC CGCCCCCCGC 
CCCATACGAC 10680 

TCTGTCAAAG CAGGAGAGGG TTTTGAGACC TCATGAGAAC TGATCCTCCT 
GATAACCTAG 10740 

CCGGTTAGAT TTCCACTCTC GCCCTTTACG GCTGCTTCGT CCTAGATAGA 




38 



GCCAGAGCAT 10800 

CTGGCCGGTG AAGCTGGGAT AGCAGCAGGG TGACCTTAGG TTCCCAACGC 
CCCTCTTGGC 10860 

CTGGCTCCAG CTGACCCGCG TCCTTCCCCG CAGTGTCTCC CCTGCGGCCC 
TGGCGGCAAA 10920 

GGGCGCTGCT TCGGGCCGAG CATCTGCTGC GCGGACGAGC TGGGCTGCTT 
CCTGGGCACC 10980 

GCCGAGGCGC TGCGCTGCCA GGAGGAGAAC TACCTGCCCT CGCCCTGCCA 
GTCTGGCCAG 11040 

AAGCCTTGCG GAAGCGGAGG CCGCTGCGCT GCCGCGGGCA TCTGCTGCAG 
CGATGGTGCG 11100 

CACAAAGCCA GGCGGGCTGA GCATGGGGAA TGGATGGGGT GGGTGGGAGG 
TAAAGGGGGG 11160 

CTAAGTGGGG GACTGAGGAA TCAGGACCGG AGATGGAGGG TGAGTAGTAT 
GAAGGGGGTC 11220 

GAGAGTTGGA ACGTAGCAGG GTAGGATAAA GGGGATTGTG GGGATGGCGC 
CCCTATAGGT 11280 

GCGCCCACCC CAGGACGCCT GACCTCACAC AGCCCTTCCT TCAGAGAGCT 
GCGTGGCCGA 11340 

GCCCGAGTGT CGAGAGGGTT TTTTCCGCCT CACCCGCGCT CGGGAGCAGA 
GCAACGCCAC 11400 

GCAGCTGGAC GGGCCAGCCC GGGAGCTGCT GCTTAGGCTG GTACAGCTGG 
CTGGGACACA 11460 

AGAGTCCGTG GATTCTGCCA AGCCCCGGGT CTACTGAGCC ATCGCCCCCC 
ACGCCTCCCC 11520 

CCTACAGCAT GGAAAATAAA CTTTTAAAAA ATGCACCCTG GTGTCTGTCT 
CTCTTTCTGG 11580 

GGTGGGGAGA AAAGGGGGGA GAGGAATTGG AGTGGGAACT TTCTACTCTG 
CTCTGACTGA 11640 
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TCCCCACATC CAAAGTCGTG CATAAGATAC GCCCCCACCG CCAGAAGGGG 
CAGAACCTAT 11700 

AAGTCTTAGA GTATAAAGGA AGCTTCTGCT GCTCCTGGAT ACCCACATAA 
TACTCAGAAA 11760 

AAAAGGCAAG TCAGAAGAAG GGAAAGATCT GAGATCCAGA GGAGCCTGAA 
GGGTCAGGGT 11820 

GACTTAGCAA GTTTCTATCT GAGACCGAAA TAAAAGGACA TTGTGGACAA 
GAGAAACAGA 11880 

GCAGGACATG AGGAGAGACA GGATCAGCAA GAGTGACAGA GAAAGAGGGG 
ACAGGCCAGG 11940 

GGTGGCCATC TCAGCCCTGA TTTCACCCAG ACTAAGGCAA AAACAACGTG 
AAGGACTCTT 12000 

AACCAAGGCT GTGCTTGGAT GGGAGGAGAA GGTACAGAGA CATTACCCCA 
GACCTAAAGA 12060 

AGACAATGCC ACCCGCCTTC TCTCCAGGTG CTCCACCATC AAGACCCAGC 
CACTGAGAGG 12120 

CAGACTCCAG TAAGAGTCCA GCTACAAGTC CTCTACAGGC ACATGTTCAA 
ACCGTCACAC 12180 

CCACACTCAG GCAGGGAAAT AGACAAGATA GGCTGGAGTT GTGGCTCAGG 
AGCAGAAGTC 12240 

TTCCCTAGCT ATGGTTCCAG CACCATGGAG ACGGAAGGGG GACTGAGCGG 
GGGGGGGGGC 12300 

GGGGGGGAGG AAAAAGTAGC AGCTACTAGG GGCATTTCTA TGACCCTTGT 
CCTCAACCAT 12360 

AGCTAGAGAC CCAGAGGAAC ACAGAAGTCC AGCAGCAAGG CGCACATGCT 
TGCAATAGCT 12420 

CCCAGACGTA AATACTTCAT TCCGTTCGGC ACATCCGGGT CATCAGCACT 
TGACTCCCCC 12480 

CCCCACACTT CTTATTACCT CCTCTTTTTT CTAAAATTTT AGATTTATTC 
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ACTTATGTAG 12540 

ATGGGTGTTT TTGGTTTTGT ATGAATGTCT GAGCACCATG TGTGTGCCTG 
GCGCCTCAGA 12600 

GGTCAGGAGA GGGCATCGGG TCCCCTGGAA CTAGTTACAG GTGGTTACAG 
CCTACCGTGT 12660 

GGGCACTAGG AACTGAATCC CAGTCCTCTT AACTGACCCG CACATCCAAA 
CCCAGGCTTC 12720 

AGCCCCTCAT CAGCCTGTCC CTCCTCCAGG CCCTCAGGTG TCTCCCGTCT 
CCGGCTGCTC 12780 

TCCCAGACAT CCTTCCATCC TCTGGTCTCC CTGCTCCTCG CCCTCCTGTT 
AACATCCTTT 12840 

CTCTCTGCCC CATCTGTCCT GGGCATCCTC TCCTGCGAGC TGCAGCAAGG 
TCAGGATGGT 12900 

TTACCTCATT TGGGATGGCC TGCAGGTTCT GAGGTCAGGG GCAACTACAG 
AGAAGAGAGA 12960 

GATTAGTCTG ATTGACTTAA GGTGGTTCAG CAAGGTCAGC TCTGCCCAGA 
CTCACGGTCT 13020 

TTTACCCAGA TGCCAGCTCT CTTCCCATCT CCTCGGTGCC TATACACCTC 
TCTGCATGCC 13080 

CCGGTTTAGA CAGGTAGCAC AGGGGCCAGG CAGACTCCTA TCCCAGCCTC 
CTCCTTCTGT 13140 

GGCCCTCTTA GGGTCTGACC TCCAATAGGG CAGGGCCAGG GAAGGGCCAG 
ACCAAAAAGG 13200 

GACAGAAAGA AGCGTGGCAG GCGGCATGGG CACACTTGAT TCAACCCCTA 
CGGCTGGTGT 13260 

ATGGGCAGCT TTAGAATGAA GGTCAGATTC TCACTTCGAG CCTCTGCGCA 
GGTGGAGTGT 13320 

TGTAAGCGTC TCGCTTTCCT CCACCTGTTT CTGGAAGAAT CAGGCTCCTC 
TTCCTCGAGG 13380 
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AGAGAATTAT ACCTGCTCAC CCTACTTCTG CCTACTGGAC ATAATATATA 
TTTTTTTCCT 13440 

TTCAGGGAGT CCTTTCCTCA GCTACAGAGC CATTTAAGGG CACTCCCAGA 
GTTCACAGCA 13500 

GATGCTTGCC TCCTCTCTTC AGCCTCCAGA AGCAGAGAGG CTTGTGAGCA 
AATGCCAGGA 13560 

CCTCTGACCT CCACACAGAC GCTGTGCTGT GTGCACAGCC CTCAAGCACA 
CAGCGAAGCA 13620 

ATAGTGAAAA GTAACTTAGA CCATTTTCAG GCTGGGGAGA TGGCTC AGGA 
GATAAGAGAT 13680 

CACTGCTCAA CTTGAGCCTC GGGACCACAG GTAAGACCAA CTTGTCTGCT 
GCAAGAGAGC 13740 

TGCCTGGTGA GATTGGGACA CACAGAGGCA GAGTTCATCT AGGACCGGGC 
ACGTCCTGTG 13800 

TTTGCCGAGG TCCCACACCC GCGGATCCCG GCCCGCAGCA GCTCTCTGCT 
CCCAGAACCC 13860 

GTGAGAAAGA GACCTCACCG CCTGGTCAGG TGGGCACTCC TGAGGCTGCA 
GAGCGGAAGA 13920 

GACCACCAAC ACTGCCCACC CCTGCCCACA TCCCTGGCCC AAGAGGAAAC 
TGTATAAGGC 13980 

CTCTGGGTTC CGTGGGGGAG GGCCCAGGAG CGTCAGGACC CCTGCCTGAG 
ACACCGCCGG 14040 

AACCTGAGGG AAACAGACCG GATAAACAGT TCTCTGCACC CAAATCCCAT 
GGGAGGGAGA 14100 

GCTGAACCTT CAGAGAGGCA CACAAGCCTT GGAAACCAGA AGAGACTGCT 
CTCTGTACAT 14160 

ACATCTCGGA CGCCAGAGGA AAACACC AAA GGCCATCTGG AACCCTGGTG 
CACTGAAGCT 14220 

CCTGGAAGGG GCGGCACAGG TCTTCCTGGT TGCTGCCGCC ACAGAGAGCC 
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CTTGGGCAGC 14280 

ACCCCGCCTG GTGAACTCAA GACACAGGCC CACAGGAACA GCTGAAGACC 
TGCAGAGAGG 14340 

AAAAACTACA CGCCCGAAAG CAGAACACTC TGTCCCCATA ACGGACTGAA 
AGAGAGGAAA 14400 

ACAGGTCTAC AGCACTCCTG ACACACAGGC TTATAGGACA GTCTAACCAC 
TGTCAGAAAT 14460 

AGCAGAACAA AGTAACACTA GAGATAATCT GATGGTGAGA GGCAAGCGCA 
GGAACCCAAG 14520 

CAACAGAAAC CAAGACTACA TGGCATCATC GGAGCCCAAT TCTCCCACCA 
AAACAAACAT 14580 

GGAATATCCA AACACACCAG AAAAGCAAGA TCTAGTTTCA AAATCATATT 
TGATCATGAT 14640 

GCTGCAGGAC TTCAAGAAAG ACGTGAAGAA CTCCCTTAGA GAACAAGTAG 
AAGCCTACAG 14700 

AGAGGAATCG CAAAAATCCC TGAAAGAATT CCAGGAAAAC ACAATCAAAC 
AGTTGAAGGA 14760 

ATTAAAAATG GAAATAGAAG CAATCAAGAA AGAACACATG GAAACAACCC 
TGGATATAGA 14820 

AAACCAAAGG AAGAGACAAG GAGCCGTAGA TACAAGCATC ACCAACAGAA 
TACAAGAGAT 14880 

GGAAGAGAGA ATCTCAAGAG CAGAAGATTC CATAGAAATC ATTGACTCAA 
CTGTCAAAGA 14940 

TAATGTAAAG CGGAAAAAGC TACTGGTCCA AAACATACAG GAAATCCAGG 
ACTCAATGAG 15000 

AAGATCAAAC CTAAGGATAA TAGGTATAGA AGAGAGTGAA GACTCCCAGC 
TCAAAGGACC 15060 

AGTAAATGTC TTCAAC AAAA TCATAGAAGA AAACTTCCCT AACCTAAAAA 
AAGAGATACC 15120 
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CATAGGCATA CAAGAAGCCT ACAGAACTCC AAATAGATTG GACCAGAAAA 
GAAACACCTC 15180 

CCGTCACATA ATAGTCAAAA CACCAAACGC ACAAAATAAA GAAAGAATAT 
TAAAAGCAGT 15240 

AAGGGAAAAA GGTCAAGTAA CATATAAAGG CAGACCTATC AGAATCACAC 
CAGACTTCTC 15300 

GCCAGAAACT ATGAAGGCCA GAAGATCCTG GACAGATGTC ATACAGACCC 
TAAGAGAACA 15360 

CAAATGCCAG CCCAGGTTAC TGTATCCTGC AAAACTCTCA ATTAACATAG 
ATGGAGAAAC 15420 

CAAGATATTC CATGACAAAA CCAAATTTAC ACAATATCTT TCTAC AAATC 
CAGCACTACA 15480 

AAGGATAATA AATGGTAAAG CCCAACATAA GGAGGCAAGC TATACCCTAG 
AAGAAGCAAG 15540 

AAACTAATCG TCTTGGCAAC AAAACAAAGC GAATGAAAGC ACACAAACAT 
AACCTCACAT 15600 

CCAAATATGA ATATAACGGG AAGCAATAAT CACTATTCCT TAATATCTCT 
CAACATAAAT 15660 

GGCCTTAACT CCCCAATAAA AAGACATAGA TTAACAAACT GGATACGCAA 
CGAGGACCCT 15720 

GCATTCTGCT GCCTACAGGA AACACACCTC AGAGACAAAG ACAGACATTA 
CCTCAGAGTG 15780 

AAAGGCTGGA AAACAATTTT CCAAGCAAAT GGTCAGAAGA AGCAAGCTGG 
AGTAGCCATT 15840 

CTAATATCAA ATAAAATCAA TTTTTAACTA AAAGTCATCA AAAAAGATAA 
GGAAGGACAC 15900 

TTCATATTCA TCAAAGGAAA AATCCACCAA GATGAACTCT CAATCCTAAA 
TATCTATGCC 15960 

CCAAATACAA GGGCACCTAC ATATGTAAAA GAAACCTTAC TAAAGCTCAA 
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AACACACATT 16020 

GCACCTCACA CAATAATAGT GGGAGATTTC AACACCCCAC TCTCATCAAT 
GGACAGATCA 16080 

TGGAAACAGA AATTAAACAG AGATGTAGAC AGACTAAGAG AAGTCATGAG 
CCAAAGGGAC 16140 

TTAACGGATA TTTATAGAAC ATTCTATCCT AAAGCAAAAG GATATACCTT 
CTTCTCAGCT 16200 

CCTCATGGTA CTTTCTCCAA AATTGACCAT ATAATTGGTC AAAAAACGGG 
CCTCAACAGG 16260 

TACAGAAAGA TAGAAATAAT CCCATGCATG CTATCGGACC ACCACGGCCT 
AAAACTGGTC 16320 

TTCAATAACA ATCAAGGAAG AATGCCCATA TATACTTGGA AACTGAACAA 
TGCTCTACTC 16380 

AATGATAACC TGGTCAAGGA AGAAATAAAG AAAGAAATTA AAAACTTTTT 
AGAATTTAAT 16440 

GAAAATGAAG GTACAACATA CCCAAACTTA TGGGACACAA TGAAAGCTGT 
GCTAAGAGGA 16500 

AAACTCATAG CGCTGAGTGC CTGCAGAAAG AAACAGGAAA GAGCATATGT 
CAGCAGCTTG 16560 

ACAGCACACC TAAAAGCTCT AGAACAAAAA GAAGCAAATA CACCCAGGAG 
GAGTAGAAGG 16620 

CAGGAAATAA TCAAACTCAG AGCTGAAATC AACCAAGTAG AAACAAAAGG 
ACCATAGAAA 16680 

GAATCAACAG AACCAAAAGT TGGTTCTTTG AGAAAATCAA CAAGATAGAT 
AAACCCTTAG 16740 

CCAGACTAAT GAGAGGACAC AGAGAGTGTG TCCAAATTAA CAAAATCAGA 
AATGAAAAGG 16800 

GAGACATAAC AACAGATTCA GAGGAAATTC AAAAAATCAT CAGATCTTAC 
TATAAAAACC 16860 
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TATATTCAAC AAAACTTGAA AATCTTCAGG AAATGGACAA TTTTCTAGAC 
AGATACCAGG 16920 

TACCGAAGTT AAATCAGGAA CAGATAAACC AGTTAAAC AA CCCCATAACT 
CCTAAGGAAA 16980 

TAGAAGCAGT CATTAAAGGT CTCCCAACCA AAAAGAGCCC AGGTCCAGAC 
GGGTTTAGTG 17040 

CAGAATTCTA TCAAACCTTC ATAGAAGACC TCATACCAAT ATTATCCAAA 
CTATTCCACA 17100 

AAATTGAAAC AGATGGATCA CTACCGAATA CCTTCTACGA AGCCACAATT 
ACTCTTATAC 17160 

CTAAAAAACA CAAAGACACA ACAAAGAAAG AGAACTTCAG ACC AATTTCC 
CTTATGAATA 17220 

TCGACGCAAA AATACTCAAC AAAATTCTGG CAAACCGAAT CCAAGAGCAC 
ATCAAAACAA 17280 

TCATCCACCA TGACCAAGTA GGCTTCATCC CAGGCATGCA GGGATGGTTT 
AATATACGGA 17340 

AAACCATCAA CGTGATCCAT TATATAAACA AACTGAAAGA ACAAAACCAC 
ATGATCATTT 17400 

CATTAGACGC TGAGAAAGCA TTTGACAAAA TTCAACACCC CTTCATGATA 
AAAGTCCTGG 17460 

AAAGAATTGG AATTCAAGGC CCATACCTGA ACATAGTAAA AGCCATATAC 
AGCAAACCAG 17520 

TTGCTAACAT TAAACTAAAT GGAGAGAAAC TTGAAGCAAT CCCACTAAAA 
TCAGGGACTA 17580 

GACAAGGCTG CCCACTCTCT CCCTACTTAT TCAATATAGT TCTTGAAGTT 
CTGGCCAGAG 17640 

CAATCAGACA ACAAAAGGAG GTCAAGGGGA TACAGATCGG AAAAGAAGAA 
GTCAAAATAT 17700 

CACTATTTGC AGATGATATG ATAGTATATT TAAGTGATCC CAAACATTCC 
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ACCATGGCTT ACGCTCTAAG ATCAAGAATC GACAAATGGG ATCTCATAAA 
ACTGCAAAGC 18660 

AACTGTAAGG CAAAGGACAC TGTGGTTAGG ACAAAACGGC AACCAACAGA 
TTGGGAAAAT 18720 

ATCTTTACCA ATCCTACAAC AGATAGAGGC CTTATATCCA AAATATACAA 
AGAACTCAAG 18780 

AAGTTAGACC GCAGGGAAAC AAATAACCCT ATTAAAAAAT GGGGTTCAGA 
GCTAAACAAA 18840 

GAATTCACAG CTGAGGAATG CCAAATGGCT GAGAAACACC TAAAGAAATG 
TTCAACATCT 18900 

TTAGTCATAA GGGAAATGCA AATCAAAACA ACCGTGAGAT TTCACCTCAC 
ACCAGTGAGA 18960 

ATGGCTATGA TCAAAAACTC AGGGGACAAC AGATGCTGGC GAGGATGTGG 
AGAAAGAGGA 19020 

ACACTCCTCC ATTGTTGGTG GGATTGCAAA CTGGTACAAC CATTCTGGAA 
ATCAGTCTGG 19080 

AGGTTCCTCA GAAAATTGGA CATTGAACTG CCTGAGGATC CAGCTATACC 
TCTCTTGGGC 19140 

ATATACCCAA AAGATGCCCC AACATATAAA AAAGACACGT GCTCCACTAT 
GTTCATTGCA 19200 

GCCTTATTTA TAATAGCCAG AAGCTGGAAA GAACCCAGAT GCCCTTCAAC 
AGAGGAATGG 19260 

ATACAGAAAA TGTGGTACAT GTACACAATG GAATATTACT CAGCTATCAA 
AAACAACGAG 19320 

TTTATGAAAT TCGTAGGCAA ATGGTTGGAA CTGGAAAATA TCATCCTGAG 
TAAGCTAACC 19380 

CAATCACAGA AAGACATACA TGGTATGCAC TCATTGATAA GTGGCTATTA 
GCCCAAATGC 19440 

TTGAATTACC CTAGATACCT AGAACAAATG AAACTCAAGA CGGATGATCA 
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ACCAGAGAAC 17760 

TACTAAAGCT GATAGACAAC TTCAGCAAAG TGGCTAGGTA TAAAATTAAC 
TCAAATAAAT 17820 

CAGTTGCCTT CCTCTATACA AAAGAGAAAC AAGCCGAGAA AGAAATTAGG 
GAAACGACAC 17880 

CCTTCATAAT AGACCCAAAT AATATAAAGT ACCTCGGTGT GACTTTAACA 
AAGCAAGTAA 17940 

AAGATCTGTA CAATAAGAAC TTCAAGACAC TGAAGAAGGA AATTGAAGAA 
GACCTCAGAA 18000 

GATGGAAAGA TCTCCCGTGC TCATGGATTG GCAGGATTAA TATAGTAAAA 
ATGGCCATTT 18060 

TACCAAAAGC AATCTACAGA TTCAATGCAA TCCCCATCAA AATACCAATC 
CAATTCTTCA 18120 

AAGAGTTAGA CAGAACAATT TGCAAATTCA TCTGGAATAA CAAAAAACCC 
AGGATAGCTA 18180 

AAGCTATCCT CAACAATAAA AGGACTTCAG GGGGAATCAC TATCCCTGAA 
CTCAAGCATG 18240 

ATTACAGAGC AATAGTGATA AAAACTGCAT GGTATTGGTA CAGAGACAGA 
CAGATAGACC 18300 

AATGGAATAG AATTGAAGAC CCAGAAATGA ACCCACACAC CTATGGTCAC 
TTGATTTTTG 18360 

ACAAAGGAGC CAAAACCATC AAATGGAAAA AAGATAGCAT TTTC AGCAAA 
TGGTGCTAGT 18420 

TCAACTGGAG GTCAACATGT AGAAGAATGA AGATCGATCC ATGCTTGTCA 
CCCTGTACAA 18480 

GCTTAAGTCC AAGTGGATCA AGGACCTCCA CATCAAACCA GACACACTCA 
AACTAATAGA 18540 

AGAAAAACTA GGGAAGCATC TGGAACACAT GGGCACTGGA AAAAATTTCC 
TAAACAAAAC 18600 
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AAATGTGAAT 19500 

GCTTCACTCC TTCTTTAAAA GGGGAACAAG AATACCCTTC GCAGGGAAGA 
GAGAGGCAAA 19560 

GATTAAAACA GAGAATGAAG GAACACCCAT TCAGAGCCTG CCCCACATGT 
GGCCCATACA 19620 

TATACAGCCA CCCAATTAGA CAAGATGGAT GAAGCAAAGA AGTGCAGACC 
GACAGGAGCC 19680 

GGATGTAGAT CGCTCCTGAG AGACACAGCC AGAATACAGC AAATACAGAG 
GCGAATGCCA 19740 

GCAGCAAACC ACTGAACTGA GAATAGGACC CCCGTTGAAG GAATCAGAGA 
AAGAACTGGA 19800 

AGATCTTGAA GGGGCTCGAG ACCCCATATG TACAACAATG CTAAGCAACC 
AGAGCTTCCA 19860 

GGGACTAAGC CACTACCTAA AGACTATACA TGGACTGACC CTGGACTCTG 
ACCTCATAGG 19920 

TAGCAATGAA TATCCTAGTA AGAGCACCAG TGGAAGGAGA AGCCCTGGGT 
CCTGCTAAGA 19980 

CTGAACCCCC AGTGAACTAG ACTGGTGGGG GGAGGGCGGC AATGGGGGGA 
GGGTTGGGAG 20040 

GGGAACACCA TAAGGAAGGG GAGGGGGGAG GGGGATGTTT GCCCGGATAC 
CGAAAGGGAA 20100 

TAACATCGAA ATGTATATAA GAATACTCAA GTTAATAAAA AAAAAAAAAA 
AAAAAGAGAT 20160 

CACTGCTCTT GCAGAGGCCC CCAGTTCTGT TCCCCACAAC CTCTTAGGAT 
GACTCACAAC 20220 

CACCTGTAAC CCCATTTCAG GGGATCTGAT GCCCTCTTCT GGTCCCCATG 
GGCACTGCAC 20280 

TCATTTACAA ATACTTTCAC ACAGAGACAC ATGCACAGAA ATGGAAATTT 
AACAGAATAA 20340 



49 



ACATCGGAAC ATTAAAAACA AAACAAAACA AAACAAAAAA CAAAAAAAAC 
CCCATAGGAC 20400 

TGGAGAGATG ACTCAGTGGT TAAGAGCACT GACTGCTCTT CCAGAGGTCC 
TGAGTTTAAA 20460 

TCCCAGCAAC TACATGGTGG CTCACAACCA TCTGTAATGG TCTCTTCTGG 
TGTGCCTGAA 20520 

GACAGTGACA GTGTACCCAC ATACATGAAA TAAATAAATC TTTAAAAAAA 
AAAAGCCCAG 20580 

AAAGTGATGA ACTCTATTAC CACCAAAAAG AAAAAAAAGA AAAAGAAAAA 
CTCAAATCAA 20640 

TCTTGAAGTC TCTTTCGCAT ATCTCTTTGG CCTCCACCCT GTCTGTGGAT 
CCCACATGTG 20700 

GGGGCGGTGG GGCATCTGTG TTCATTTGCT GAGTGTGAGA GCCACATAAA 
GTGCTGGTTT 20760 

ACGTGTTTAC TTGTTTTCTA GATGGGATGG AGCCCAGGAC CTTAACCTTG 
TGGGGCAAGA 20820 

CTTGCTCTCC TGAGCTCTAC CCAAGCAGTC TGGATTGCGG GTTTCCTGTT 
TGTCTGTGAG 20880 

CTCTCTGCTT TGTGGTCATT TGTGCCCACT GGCTCTTAGA TCCCATGACT 
TCCCAGAGAA 20940 

CGCTGTCCTG CAGAGGCAGA CACAGGGCCC CTGAGCTCAG GCCCGGCCCT 
GGAGACAGAA 21000 

ACGGAGAGGC CAGTTGATTC TTGATATTTT CCGTTGTGGC TTCCTTGGGG 
CCTGTGTGAG 21060 

AATTCAGCTC TGTAGAAACC TTATGGTTCT GCAACTACCC TCCCCTGGCC 
AAGACCCTTT 21120 

CTTCTAGCCC GGGATTACCC CCTCAACCTC TGAGGTCGCC GCCAAGGTCT 
CCTTCCAGAT 21180 

ATGGAAGTGA CTGGATGAGT CCCTTGCGGC CTCGCCTGCC TTCCCATCAC 
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CCAGGCCCCT 21240 

GTTTGGCTTT GCCCTTTCCC ACAGAAGTCC ACCATTGCTG TTTGGACTTC 
CAAATGGTGC 21300 

TCCTAAGTCT GTCTGCCGCA GGCCTTACCC CAGTCGGGAG TGGGAAACGG 
GCCTAACTGG 21360 

AGATGACAAC CTGTAGAGAC CCCTCGGTCC TCCTAGCAGC CTGCTGGGCT 
GTTCTCCCTC 21420 

TGAATTGCCA ATGTCCATGG CGTTCCCGGT GCCTTTCCTC CCTCCCGTTT 
CTGACAATTA 21480 

GACGCCAGTC AAGTTTGAAA AGGAAATCTG CTTTATTTAT TTATTTATGT 
TTTTTTTTTA 21540 

ATTTTTTTCT GGTAGTGGCC ATGGGGAACG AAGGAAGCGC CCTAAAGGTA 
TCATCACAAA 21600 

GCAGGGCTCA GCGGCCGGTC TCAGTGCTGG GAGAAGGCGG CCTCAGGGTC 
GCAGGCGGGG 21660 

TCCTCGTGGC AGCCGTCTGC ACAGAGAGAC GCCGAGTCAG GGCCGCCCTG 
GCCCAGCCCG 21720 

CCTGGTCCTG GAGCCCCGGT CCCGTCTCGA CCCCTGCCCG ACTCACCCGG 
GCTGCAGCAG 21780 

ATGCCGGCGG CGGCGCAGCG GCCCCCGCTC CCGCAGGGCT TCTGGCCGGA 
CTGGCAGGGC 21840 

GACGGCAGGT AGTTCTCCTC TTGGCAGCGC AGCGCCTCGG CCGTGCCCAC 
GAAGCAGCCC 21900 

AGCTCGTCCC CGCAGCAGAT GCTGGGCCCG AAGCAGCGGC CTTTGCCCCC 
GGGGCCGCAG 21960 

GGGAGACACT GGTGGGAGGG AAGGGATGAG CCGGGGGCGG GAGGGGAGCG 
GCCGGGGAGG 22020 

GAGACCCTGT GGGGCGGGGG GCTGAGCCGG GCGGGCGAGG GCGGCCGGAG 
GAGCGCGGGA 22080 
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GGTGGCGGGT CTCCCTGGCT CTCTCTTTGG GCTCAAAAGC GGTCGAAGGA 
GGGCAGTCAA 22140 

AAGCTCCTCC GCTCCCTCGA TTCCCAGGCT AGGTGGGGCC GGTACGCGGT 
CAGCGCGGGA 22200 

AAGGGGGCGG CGGGGGCGAC CCTGTGGCAG CGGGCCGGGC AGCCCGGAGA 
GCCACGGGTC 22260 

GAGGGCGGGG CTCTCACCGT GCGCACGTCG AGGTCCAGCA CCGCGCGTTT 
GCCGCCCAGG 22320 

GGGCAGTTCT GAATGTAGCA GGCGGAGGTC AACGCCAGGA GGCCGAGCAG 
GCAGCAGGCG 22380 

AGGCTGGAAC CTGCCATGGC GTTGGTGTTC AGTCCGAGAT CGGTCGACCG 
ATCCACCGTC 22440 

GGTGATGGTT TCTCCAGCCC AGACCGACCT TTTTATGCCT TGTCCACTGC 
CATGGTGGGG 22500 

CCCAGTCTAA GAGGGTGACT GCATGACTGG TCACAGCCAG GTCTCTTGGG 
TCAAACTGTT 22560 

CCACACTGTT TAGAAGCAGG CCCTTCATTT GCAGGGTCTG GGCTGGGGTC 
AAGGTCACCG 22620 

CCTCAGCTAA TGACCTGAGC TCAAAAGGGA CACAGCCTAG AAGGGGAGGC 
CTAAGCTACA 22680 

AGAGGATAAA GAGACTTGGA GGGGGTAGAG GTGCAGCCTA GCCAAGAGCT 
GTTTTTTCAT 22740 

AGAAATCCAA TACCTCAGAA TGAGGTTGGA TAGCGCAAGT GGGTGAGGAA 
GCCCTTACGT 22800 

GGATCTAAAG CTTAGATGGG GAAAAGGATC TTGTTCAATC TCTGAGTGCA 
GCTCAGCCCT 22860 

TCTTCTAACT AGCCCGTAAA ACAAAATATC AGTAGAAATC AAACCCAAAA 
ACACAACAAA 22920 

CAGACC AAAA TAAAGTAAAA AGAAAGAAAA ATCAC AATAA AAGGAAAAAT 
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CACACTTGCA 22980 

CTTACAACTC TGTATTAGGG CTGGAGAGAT GGCTCAGTGG TTAGGAGCAC 
TGACTGCTCT 23040 

ACCAAAGGCC CTGAGTTCAA ATCCCAGCAA CCATATGGTG GCTCACAACC 
ATCTGTAATG 23100 

GGATCTGATG CCCTCTTCTG GTGTGTCTGA AGAGAGCTAC AATGTGCTTA 
TATATAATAA 23160 

ATAACTTTAA AACTCTTTAA AACTCTGTAT TAGAACTTGC TATGAGGACC 
AGGCTGGCCT 23220 

TGAACTCACA GTGATCTATT TGCTTCTGCC TCCCAAATGC TAGGTACCTA 
CACTCCCGTT 23280 

TGAGAAAACA CAGGCCATCA GCTGCTTGAG CGTGGCCAAC AGGCGGCCTC 
AGCTACAGAG 23340 

AGCCATTTGT CCTAAGGCCA TACCCTTCCT GGTGGCCACA TGTAATGGTG 
GCCCATTTTA 23400 

GTACATACAA CTAGGCATCT CGTGTTGCAT TTCAGGGTTG GGCTGCAGGC 
CTGCATAGGT 23460 

CTGCATGGGA AGAATGCTAC ATGCAGCTCA GTAGCAGACT GCCTGCCTAG 
TGTGTGAGAG 23520 

ACCTTGGGTC CAGTCCCCAG CATGGTGGTA GCAAGACATT TTGGGAACAG 
TTTTTGCTTT 23580 

AAATTTTAAC TTTTATTTGT GTGTATGTTT GTGTACACAG GTTTCCTTGG 
CAACCAGAAG 23640 

AGGATGTTGT ATCCCCAGGA CGTGTCATTA AAGGGGATCA TGAGTAGGCC 
TATGTGGGTG 23700 

CTGGGAACAA AATTCAGAAT TCTGCAAGAG CAGTGTGCAG ACTTAACCAT 
TAAGCCATCT 23760 i 

CCCCAGCTCC TTGATTTTGC ATTTGAATAC AGTTTAAAAT GAAATGCACA 
TTAAGCCACG 23820 
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GTGACAGTGA TGCAAACTTT TAATCCCAGA ACTCAGGAGG CAGAGGCAGG 
AGAATGTCTG 23880 

TGAGTTCCAG GCCAGCCTGG TCTACAGACC TAGTTCCAGG CCAGCCTGGG 
CTACACAAAA 23940 

AAACAAAAGC AAAACCAAAA CAAAATAAAG ACACAGACAA ACCATGGCAG 
GAAGACATGG 24000 

GAGCCTCAAC CTCTTCATTT GACGGCTGAG AAATCGAAAA CAGATGACCA 
GGAGAGACCA 24060 

AGGTCTCACT GCTGCCTTCA AGGCTTGCCC TCAGTGACTG GAAGATGTTC 
CACTGGGCCG 24120 

CATCTTAATA TCTTACCATC TCAGGGCTGG AGAACTGGCT GAGTGGTTGA 
GTTGCTCTTG 24180 

CAGAAGTCCT AGGTTTGATT CCGAGGACCC ACAGGGTGGC TGAGATCACT 
TATCCCAGTT 24240 

CCAGTGGAGC CAGTACCCAA ATAGTGCATT ACACACTTGC AGGCAGAACG 
TTCAGACACA 24300 

TAAAATAAAA TAAATAGACC TAAAAACATT TAAAAGAAAG GAGAAGCATT 
ATCCAGAGTC 24360 

GTTTTATTTT GTTTTGAGAT AGACTCTTAG TTGACCTGGG ACTGTCTGTG 
TAGACTAGGC 24420 

TGGGCTTGAA CTCACAGCGA TCCCCCTGCC TCCCAAAGTG CTGGGTGTAC 
CACCGTGCCA 24480 

GGTACCTAGG CCCCTGTTTA AGAAGACACT TGCCATCAGT GGCTGGGTGT 
GGTCTTAGCT 24540 

GCAGAAAGCC ACCTGGCCCT TCCCAGGTGT CCACATATAA TGGTTGGTCC 
ACTTTGGTAC 24600 

GAATGCTGGG CACCCCAACT GCATGTCAGC TTTGGGCTTT GGGTTAGCTG 
AGGTCTGCAT 24660 

ACTGGTTCTA GTTGCCCACC CCTTCTCTTC CATAGAGGTG GGGCCTAAGC 
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CCGTGTTCTA 24720 

AACTCCATCT CAGGCTCTCT TAAGAAGTGA CCTGCGACAT CCAGGAAGAA 
GTAACAGCCA 24780 

GTGCCCCCGA GACCCACTCA CTACATGCAG TCTCAGCCCC TAGAGAGGAT 
GGAAAAGCCT 24840 

CCGGTCTCCT TGTTCTTATG ATCAGCCTTC TCCTCAAGGA GCTGGGGCCA 
GTGGGGCAAA 24900 

GCACATTCTC TTCTGACCCT GAATCACAGA TCCTGAGTCA CTGGTGCAAA 
CTATCAAGCG 24960 

CTAAGTTGGT GGTGAGGTTG ACCTGTACTA CAAATCACTT CATTTCTCAC 
CCAGACTAGC 25020 

TTATTGGCAT TCCAGGCATA GAAAGCCAAG AGCTTGACCC CCACTATAGC 
CCCAGAGAGA 25080 

CAGCCCACAT AGTCTGTGGG CATAGTGATC TCATCTTAGG TAATCCATGC 
ACATAAATTA 25140 

GCATGTCTTG ATAATACATA CCTAATGCTC CTGTTAGGCC AGCATGCCTA 
ACATGCTCAC 25200 

CAACCCAATC TGTGTTTGGG AAAGGCCAAT ATTCCGCAAG GCAGAATGCT 
AGTCCTTCAG 25260 

GAATGGGGCT GCAGCTGGAC TGGGGAGAAC ACACTGAGGT TATAAGAGGA 
CCATTGAGGC 25320 

CTAATAGCCA AGGTAGAGT A GGCGGAGCCT TGGGTTACAG TGTTCAGCAC 
CAGGAGGAAA 25380 

GAGTCACTAT CACCATGGGG TTCATCTGTC ACTGGAGGAA GCAGAATATG 
AACTAAGAGG 25440 

CATATTATGT TGGGTTACGA CTTTAGTTAA GATCTGAGTG TATCCCATGT 
GATACATTGT 25500 

CAGTCCTTAG GAAGATGTCT TGGGAGATGG TGAGATCTTT AAGATAGAGA 
CCCAGTGTCA 25560 



55 



GGTTCTTTAT GTCTCTGACA GCATGCCCAT GAAGGAAGTG GTCTCTCCTG 
GATCTCTTTT 25620 

TCAGTTTTGC AGGCATGGGA TGAAGGGGTG TATCCTTCCG TGTGCTTCTG 
CCATGATGTG 25680 

TACTTCAACA TAGACCTGTA AGGAACAGTG GCTACAGATT GTGGACTGAA 
GTCTCTGAGA 25740 

CTGTGAGTCC AAATAACCCT TTCTTTCTAG GCATGGCGGC ACACACCTGT 
AATCTCAGCA 25800 

TGCTGGAAAT GTGCAGCAGG ATCAGGAGTT AAAGACCAGT CTCAGATAAA 
TGACAGTTCA 25860 

AAGCCATCAA GGGGATAATG AGATACTTCC TCAAAAACCA TCAAATTAAA 
ACTTTTGTTT 25920 

TTATACATTA CAACTTGTCA GGGGTTTTGC TATAGTAATT AAAAGTCACC 
ACAGGAAACA 25980 

AAGGCACGTA AACATAGCAA CATGTGCTAT GTTTAAGGCA ACATGTGCTA 
GGAAGGTAGA 26040 

TATCACCATG CTGGGTGCTT AGACCAGGGC TATGTCGAGG TCCCGGAGGA 
GAGCTGAGGA 26100 

AGCCCTGGGT GAATGTATAA TGTATCACGG GCCTCAGACC TGTGAGATCT 
GGCAAAGCTT 26160 

CCCCCTGCAC GCTGTGGGTG AGGTGAATGG GGATTCGGCA GAGCCTTTGT 
CTGGTCTGAG 26220 

TGCAAATGCT GACGGTATGT TCTAGTGGAG GTGTTTACAA AGGACGGGCC 
AGTGTGCGCT 26280 

TTAGCCATAG AAGTGGTGGC TCCCTGATGA ATGTCCACAA CCTGGGATTG 
CTGCCCACAA 26340 

GATCAGCCAG GCCCTCTCCT GCGCTGTGCA GAGTGAACAC ACGGAGGTTC 
TGGGCTGCTC 26400 

CAGTGGCTGC TACCATTCTG CCAGAGAGTG CACAGGCCAC CTGACCCCAG 
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CCTTTCTGTC 26460 

CATGTGTCTG TCCTTTCTTC ACTCTCTCAC CACCCTTGTT AGGGTCCCAG 
ATCCAAGTTA 26520 

TGTAGGGGGT GGATTAGGAA ATGCTATGGG ATGAGAGGCA GTGTTGGTTG 
TCATTCTCCT 26580 

TAGGGTAACC TGTGAGTATC AAGGAAAGAA AGTGTACACG CAGAAGGCTC 
ACCGTGCTGC 26640 

TGCTATGTAC AAGTGAGCAC AAATGTAACC TCTGGAAATA CCCATTTATC 
ATGTCTGTTT 26700 

TGGGGGCAGA GCCCAGGCAG GCGTTTCTAC TCATGGTCCT AGGAGCAGCC 
TCTCCTCATC 26760 

TGGTATGCAG CCCTTCCTTA TCCGAGACGG AGCCTGGTGC CGGGACACAG 
GTCATTTCCC 26820 

TGCAGTTGTA TATTATTTGG GCAGCTCACT TCTTTAAAAT ATTTTTGAAA 
AAATTATGTG 26880 

TATGAGCCTG CATGTATGTC TGTGCAAAAT GTCCACAGAG GCCAGAAGAA 
GGTGTCAGAC 26940 

CCCCTGGAAC TGGGAGTTCC GGGTGGTCGT GTTTGGCATA TGGGGCCTGG 
AAAATGAACC 27000 

CTGGTCTCCT AGAAGAGCAA CCAGTGCGCT CAGCTGCTGA GCACCTCTCC 
AACTCCTGCT 27060 

TCTCTGGACT GGGAGACAAA GGAAAAGTGA GAGACTGATT CTGTTCTGTC 
AAGTCTCTGA 27120 

GCATAGGGAA GACCTAGGTT CATTCTATGT CATCTGTCTG TCTGTCTGTC 
TGTCTGTCTA 27180 

TCTATCTATC TATCTATCTA TCTATCTATC TATCTATCTA TCTGAGACAG 
GATTTCACTA 27240 

TGTTAGCCTT GGCTGTCCTG GAACTCTATG TAGACAAGGC AGGTCTTAAA 
CTCGCAGAAG 27300 
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ATCCTGGTGG TCTCTCCCCA CTTTGCCTGA TTAGGCTCAC TTTTAAAGGG 
AATGAAATGG 27360 

GCTGGGTGTG GCAGTACACA CCTTGCTCTC AGCACTCCGA GGCACAGAAA 
GGCAGATCTC 27420 

TGAGTTTGGG GCCAGCCTGG TCTATGCAGT GAGCTATAGG CAAGCCAGGG 
CTACATGGTA 27480 

GGACCTTGTC TTAAAAAGAG CCCCAAACAA ATAGCTCACT TGCCCAGGTG 
AGGTCCACCA 27540 

GCATCTCTAC ATTTTGACCG GAAGCTAAGA GGAATCTTTA TTACATCACG 
CCTGCCACAG 27600 

TCTCCATCTT TGTTGCAGCT GGAGTGCTCC CACAGGGCTT CCACTGCACG 
CACTGCACCC 27660 

GAAGGGGCTT CCACTTCACG CACTTCACCC GAAGGGGCTT CCACTTCACG 
CACTTCACCC 27720 

GAAGGGGCTT ACACTTGATT CACTTGACCC GAAGGGGCTG ACACTGCTTG 
CACTGCACCT 27780 

TAAGGGGCTG ACACTGACCC AATGGCACCC GAAGGGGCTG ACACTGACCG 
CACTGCACCG 27840 

AAGGGGCTGA CATTGCACAC GCTGCACCCA AAGGGGCTGA CACTTGCTGC 
ACTGCACCCC 27900 

AAGGGGCTGA CACTTGCACG CACTGCACCT ACCAAGGGTG ACACTGCACC 
TGCTGCACCC 27960 

AAGGGGGCTG ACACTGCATG CACTGCACCT ACCGGGGCTG ATACTGCACC 
CACTGCACCC 28020 

AGGGGGGCTG ACACTGCACC CACTGCACCC AGGGGGGCTG ACATTGCACA 
TGCTGCACCC 28080 

AAAGGGGCTG ACACAGCACC CACTGCACCC GAGGGAGCTG ACACTGCACC 
CACTGCACCT 28140 



ACCGGGGCTG ACACTGCACC GCTTGTAATG TACATTACTG TTTTTTTTTT 
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TTCTTTTCTT 28200 

TTTTTCAGAG CTGAGGACCG AACCCAGGGC CTTGCCCTTG CTAGGCAAGT 
GCTCTACCGC 28260 

TGAGCTAAAT CCCCTACCCC TACATTACTG TTTAGAAACA AATTTATGGT 
CCTTCTCACA 28320 

TGCTGCAGGA GATTACACAA AGTTGGGGGT TATCAAGAAT GTGGATCACG 
GTGGATCATT 28380 

TTAGCACTGT CCCCCCCACA GAAAGGGTCA TTTCTAGACA GAAGAAAATA 
GTTTATATGG 28440 

AACACTTCTG GGCTGGGCAG TGGTAGCACA TGCCTTAAAT CCCAGCGCTT 
GGGAGGCAGA 28500 

AGCAGGCGGA AGCACGCGGA TGCACGCGGA CGCACGCATA TCTGTGAGGT 
TTAGGCCAAC 28560 

TCGGTCTATG CAGCAGCTTC CAAGACAGCC AAGGCTGTAT GGAGACCCTG 
TCTCGGGGTT 28620 

GGTGGGGAAT CTCTTCACCG TCTTGGTCAC TTCTTTATGT GTGAGACACA 
TAGACGTTTT 28680 

TCTTCTGAAT ATTTTATTGC TGCTTGTGGC ATTCACAACT TAGGGAAAAA 
TTGTTAAATG 28740 

CTGCATTCCC AGCACTTGAG CCAGTGAAGT TCAGGCCTCC GCTCGTCTTG 
TAATGGTATT 28800 

TGCACAGGGG ATGCCTTGGC TGAGTGAGTT CTTCCAGAAA ACTCCTGGGC 
CCTTAACACC 28860 

TATTTCCAGC ATTTGGAAAT CCGAGGCAGG AGGATTGACA TGAGTTGCAG 
ACATAGTCAG 28920 

CTAGAAGTGC AGCATTAAAT CCTATCTTAA AATAATTATT AGAATAATTT 
AGGGGGAAAA 28980 

GCCTCTAATA GAGATGGGAG AGTGTGCGCA TGACTGCCCT ACTGTGTGCT 
TCTAGAAATC 29040 
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AATATGAATG GGCCAGAACT AGAGAAAAGG CTGTGAGAGG CTGTACCCTA 
CTGTGTGCAA 29100 

CCCACTTCCC TCCTACTATG TGGGTGCTGG GCATGACACG AGGTTATCAG 
GCTGGGTGAC 29160 

AAGCACCCTT ACCTGTGGGC AGTCTTGCTG GTCCAACCTA TTTGCATTTG 
AATCCCAGCT 29220 

ACTTCAAACC CCATGGGTGC ATATTTACCC ACTTTTGGTT TTGGAAACAG 
GATCTTAAAA 29280 

TAAACAGGTC TCACTCTGTA ACCCATGCTG GCCTGAATTC AGCATCTTCA 
GCCTCAGTCT 29340 

CCCAAGCGCT ACGATTTCCT ATGTGCCATA TGTCACAATA CATGCACTTC 
AGTTTTGTCA 29400 

AAAGAAGTGA ACCAGGAATA ACTGGTACCT ACCTATAAGA CTGCTGTGAT 
GAAGGAGGAC 29460 

ATTGTGTAAA ACGAAACTCA GGATATAGTA AGTGCTCAAC ACGTGTTAGA 
CATGTTGGTC 29520 

TCCATGAGGG CACAAACCCA GGGCCTCATG CATGCCAAGA ATTGGCCCTA 
TCACTGAGCT 29580 

ATACAATTAG TCCCTATGAC CTACTGTGAC CTCAGACGCA CACCATGGAT 
CTGACATTGC 29640 

ATCAAATCAG AAATGAATTT CTGAAAGACT TGCTCATAGC ATGCCCTCCC 
ACACCCCCGT 29700 

CCCAGCCCCC CCTCTCACTG GCAAGGACAT CTCACTGTGG TGGTGGCAGG 
GCCTCTAAAA 29760 

CATCATAGGA TAGCTGAGCA GCAGTGGCAC ATGGCCTCTC AGTCCCAGCA 
CAGGGGAGGC 29820 

AGTCAGCCTG GTCTATAGTG TAAGCTCCGG GACAGCCAGG GCTACATAGA 
GAAACCCTGT 29880 

CAAACCTACC CTACTTAAAA ACAGAAGTAG AGCAGTAGTT TGGATTACAC 
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TGCTTTGACA 29940 

CTTGGTGGGT AGCATGTGTG CACCTGCCCA GGAGCTATCT GGATTCTCAA 
ATGGAAGACA 30000 

CAGACACAGA CACAAACACA AACACACACA CACACACACA CACACACACA 
CACACACACA 30060 

CACACACACA CACACCAGTT AACTTTTGAC ACGCCATGAC TAGCTCAAAG 
GCTAGGGACT 30120 

CCCAAACCTT CCCCTGTCAG CAAATGCTCC CCTCTGGTAC TCCTGAGACT 
AAGCTAAGCC 30180 

TTCCCCTGCT GTCCCAGGCC CAACGGAGGA AGTGAGCATG GTCACTTACC 
TGATTCTTTT 30240 

TTTTCTTTTT TTCGGAGCTG GGGACCAAAC CCAGGGCTTG CGCTTGCTAG 
GCAAGCGTTC 30300 

TACCATGAGC CAAATCCCCA ACCCCACATA TTCTGATTCT TACATGGCTG 
ATTGGCTTTC 30360 

TGTCCCTGCA GTTCTTACAT CCTGTCCTTC TTCCCTGAAT CATGAGGACC 
CTCTCCTCTC 30420 

TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCTCTC TCTCTCCCTT 
CTCTCTGTGT 30480 

CTCTGTGTCT CTGTCTGTCT GTCTGTCACA CACACACACA CACACACACA 
CACACACACA 30540 

CACACTAGCC CATGCAAATC TAAGGGCCCC TTCCCGTCTC CCTTTGCCTG 
ACCATTGGCT 30600 

CCTGGCATCT TTATTGATCA ATCAAAAACC AATTGGGGAT AAGGACCTAC 
AGTGTTTGGA 30660 

CATGAAGATT CCTAATTTGG GGGCTGCATT AATTCAAAAC ATTGGAACCA 
ATTCCCAACA 30720 

ACAACAACAA CAACAAATAA AGCAAAGAAA AAAGTTTACA ACGCTGCCTT 
CATTATTTGA 30780 
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GAAACAAATG TAAGGAAAAC CATCAGGTAT CTGGACTTTT AAACGGCCGG 
GATTTAGAGA 30840 

CTCTGGGATG TTTTCTGTGT TGGGGATTGA AGCCAGGGCC CTGGGACATG 
GTAAGGAAGC 30900 

ACTGTACCAT GAAACTACAC CCCAGAGTCT GATAAGGCTA CTGAATGACA 
ATTAAAGATT 30960 

CATAATTGCT GAAATTCTGG AAAACTCTAA GCTACCAATT TTGTATATGC 
TCAACTTGGT 31020 

TTCCTGAAAA CATCTGAGGT TCTTGCACGT AACTTTTCCT CAGAGCAAGT 
ACAACTAAAT 31080 

TCTGACTTTG TGACAATAAA GATTGTCAGG AAAGGCTTTG TGAAAATGTT 
CAGTCCCCAG 31140 

GAGACGTGCC CTCCTGCAGC CTGTGAATGG CGGCCAGGTC ACAAGTCAGC 
AGATGCAGTG 31200 

GAACGGAGTG TGGTACTTCT GTGAGACACT GCAGGACTGG ATGGATGGCT 
TAGTAGTTAA 31260 

I GAACATGGGC TGCTCTCCCA GAGGACCTGG TTTCAATTCC AAGCCTTGGG 
CCCTGCAAGA 31320 

ACCTTATATA GACTGGCTTC AAGTTCTCCA TGTAGCTGAA TATGACATTA 
AACTCCAGAT 31380 

ACTCCTGAGT CCTAGGTTTA CAGCTGTGTA CAGCTATGTT TCTTCCCTGA 
CGACCCCGCA 31440 

GCCCCCATTT TGAGATAGGG ATTTAGGTAG CCCAGGCTGG CCTCACACTG 
ACTAAGTGAG 31500 

ACTGGCTTTA AACTCCTCAT CCTTTAAGGT ACCACCATGA ATTTGCTGTA 
TAGCTCTGGC 31560 

TGGCCTTATA TAATGTAGAC TAGACTGGCC TTTAACTTTA AAATTGTGCA 
CTTTATTTTT 31620 

TTTTAAATTA TGTATTTTAT GTATATGAGT ATACTGTAGC TGTCTTCAGA 




62 



CACAGGGCAC 31680 

CAGACCTCAT TACAGATGGT TGTGAGCCAC CATGTGGTTG CTGGGAGCTC 
AACTCAGGAC 31740 

GTCTGGAAGA GCAGTCAGTG TTCTTAACCT CTGAGCCATC TCTCCAGCTC 
TCTGCTTAAT 31800 

AAGTGCTGGG ATGACTAGCA TGTGTCACCC TCCTGGCCAC TTCTGGTGTC 
TCCTTTCCAG 31860 

GCTTTTAAAA ATTATCTGTT GGCATGTCCA CACAGGGTTA TATGCATATG 
AACGCAGGTG 31920 

CCTGTGGGCT GTCCTGTCCT GGAACTGGAC TTACAGATGG CTGTGAGCCA 
CTTGATGTGG 31980 

GTGCCTGGAA ACTAACTGGG GGTCTGAAAA AGCGGGAAGA ACTCACATGA 
CTGTGGAGTC 32040 

TGCTACCCCT TTTATTATAA AAGAAAAGAA GATATTTTAA CAGCACGTAT 
GAGACACAAG 32100 

TGAAAGCTGT GGCCATGGTC TTCAGGGATG GTTAGGTCCT GCAAAACTGA 
AGGAGGTGGG 32160 

CTCTGGGTGT TGGTCACATG GTAGATTGAT AGGCCCTGGG TTCAATCCCC 
ACCTCTGCAT 32220 

AAAGCAGGCA TGGTGGTTCA CTGCCTGCTT TTAGAGGAAG AGGCAAGAGG 
ATTGGTAGAA 32280 

TCTCAAGGTC ATTTTCAGCT ACATAGCACC AGTCAAATCT TTGAGTCCAA 
GACCAGCCTG 32340 

ATCTGTGTAC TGAGTTCCAG GGCAGCTACA TAGTTGAGAC CCTACTTAAA 
ATTTCAAACA 32400 

ACAAAACCCA CAAGGTTTAA AAACTCTATC ACTTTTAGTT ATGTTTGTGT 
GTAAGTGTTC 32460 

GTGCCCACGG AGGTTAGAGC ACTGTATCCC CCGGAGTGGT GAGCGGGCTG 
GCATGAGTGC 32520 
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TGAGGGCTGA ATTCAGCATC TGTGTTCAAC ATATTTGTTA AAGCACACGA 
AGAGGAAATG 32580 

GCCAGTGTCA ACAGGAGCCC AGCCAGGCTT GGGGTGGGAA AATGCTTTGA 
CTTCTATCTG 32640 

GCAAGAAAAA AACAATTCCA AGTTTGATCC TTGCCAGACT CTTTGGCCTT 
TACCAGGCTT 32700 

CCTCACAGAG TCTGCTGTAA CTGTTTCTGC AAATTCGCAG AGGAACCTGA 
GATCTCAGGG 32760 

CACGTTGGAT ACCCACGTGC TGGAGAAACT GAACAATGAC TTTAGGTTTC 
ATCGTGCCTG 32820 

GATGAAACAT GAAAATACCC CACACCGCTG AGCTGAC AAA TGTGCCTCTC 
TCTCTGTAGC 32880 

CCTTCAGTCA GCTGAGCAGT TTGCCCTCGC TCGGCTGCAG TACCAGCACA 
GGCACCCCAG 32940 

TCTCCCCGAT GAGCGGTCTC ATGAGGATTC AGGTTGGTCT CGAACTCCCT 
ATGTAGCTGA 33000 

AAATGACCTT GAGATTCTAC CAATCCTCTT GCCTCTTCCT CTAAAAGCAG 
GCAGTGAACC 33060 

ACCATGCCTG CTTTATGCAG AGGTGGGGAT TGAACCCAGG GCCTATCAAT 
CTACCAAGTG 33120 

AACAACACCC AGAGCCCACC TCCTTCAGTT TTGTAGGACC TAACCTAGCC 
CTGAAGGCCA 33180 

TGGCCATGGC TTTCCCCTCA GCACCCACTT ATCATGAAGG GGCAAGGGTC 
CAGTTTCTTG 33240 

GTTAAGTATC TACGCTTGTG ACTAGGGAGA TACATCCTGG GCAGGAGTGA 
AGGGTTACCC 33300 

ATTCAGCAGC AGAGTTCCTA GGTTTACTGT GACAACAAAG ATCTAGGAAT 
GGCCTAGGTT 33360 

GTCCTGACAT GATCCCATTA GCCTACCTCA GATATCTGAA TGCAGGGGCT 
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CACTGTGTGT 33420 

CCCAGTCAGG GACAGTATTT ACTACCCTAA AGTGGGTTAC AGCTCTCGGG 
GGGGGGGGGC 33480 

TGCGTGCAGG ACGACACCTG CACCTTCACA CTTGCTTCTT CAATGGAGTA 
AGAGGCTGCT 33540 

AACATCCCCA AGGTTTCCAT TTCAGCTAGG ATGAGAGTCT GGAGTTCATG 
TCCCTGGTAT 33600 

TCAAGTATAT GACACTGAAG AGCAAAGAGG CAGAGAGCTC ATCCACTAAC 
AGGCATGCAC 33660 

TGCACTCATG AACATCTGTG TCTGATCCCC GGTACACATC AAAGCCATGT 
GCACCGAATT 33720 

CCAGCGCCAG GCAAGCATAC GCAGATGCAT TCTCTGAGGC TAGCTGGCTG 
GGCAGTCTAA 33780 

ATACTGAGTG CCAGGTTCCA GTGAAAGGCC TCGTCTCAAA AACCAGAACA 
AACCAAATCA 33840 

AACCAAACGT AAAGCAGACC AAGGTGAGAG AGTCTTGAAA TGACACCCAA 
GGGTGCTGTC 33900 

TGGCCACTGC TTACACACAC TGAAGGACAG CAACTGACCG CAAGAAGCGG 
GTTTAGAGTG 33960 

GAGTCTACTG TCTGCTGGGT AGTCCAATGA CGCTGTGTCA GGGCAGGGTC 
CGGTTACAGA 34020 

AATCACTGAC GGGGAAGCCT TCCCAGGAGA AACGGGGCAC CCTTTTTGCT 
TTCTGGACCT 34080 

TGGACACACC TGACCCTACC CAGCAAAGCC CAGGATTGAG CAAAGCAGAT 
AACTAACTCC 34140 

TGGCTCAGTT AGGTGAACTG GCTTTTGGCT AATAACCTTA AGACCCAAAT 
AACTGGGACA 34200 

AATAAACTTA TTCTACAACA AGAAAAAGCA AGCCACATAA CAAAAGGCTT 
TGCTTTCCAA 34260 
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TAGTTTATTT ATAAAAGCAG GAAACATTGG GCTCACACTA TTCCAAGAAA 
CTCAGGAGAC 34320 

AGCTCTTGGC TTCTAGAGGG GACAGTCCAG TCTGATCTTC TGTGTGATAG 
GAACTTCCCA 34380 

GAGTTTAATG CTGTCCGGAG TTTGTATGTG GTGTCAGAAC AGGTACTATA 
CCTGGTGCCA 34440 

AATCTAGAAT GAATGGGGGC TGCTCTGGAC AAGGCTGTGC CACCCTCCTG 
GGATGCACCA 34500 

ATTTCTACCC CAATATCCAG TCCCTAGGCA AGCACTGTCC TTATTTGGGC 
CTGAGAGGCC 34560 

TCTGTTCTCA GTTCCTGACC ACCTGCCCTT CTTTGGTGAC CGCCCAGTTA 
TAACTCTTCG 34620 

GAGAGTCCAA GGCCTCTTCT ATCCGTGCCT CCAGGTTCTC CCGAGTGATG 
AAGTTTTTGG 34680 

CCTCCTCCTA CGGGAAGAGA AAGGTTAGCT ACGTGGGATC TCTCAGAGCT 
GGCTGCCTGC 34740 

ATAACTTACT GTCTCTAGCC CACCTTAAGA AGGGTGTCAA AAGAGCCCAG 
GGCCCCTTTG 34800 

AGTGTCCCAC CAATAGGCCA GCACAACTGT ACTAGGTGAC CTCAACCATT 
ATATCCCTCT 34860 

TCCTCGGACA ATTACCCATC AGATCTCTCA GCCACAACCC AAGAACGAAT 
CGAATCGTGA 34920 

CAACCTGATT GGGGGCGGGG GGAGGTGCGG GGTGGTGGAC TCTCTCAACC 
TTGTCACTCT 34980 

CCCTTACACA ATAAAAACAG CTCCATTCTC ATTTATGGTT TCCCTGACCT 
TCAAGTTTCA 35040 

CACGGCAATA CTAGCCTGTA ACCTTTCATT GACCATTGTC TGGGAGGCAC 
TCTGTGTGTG 35100 

CAGAGCAGAG AGAGAAGCCT CCTGAAGACT GGTCACTTGA CACCCTGTGC 
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TGCTCTCCCC 35160 

AGAGCCTCCC TCACCGGAAT CTCCAATACC CACATTCCTC ACGACCTCGG 
CCCACCTGCA 35220 

GTTTGAGAAC TTCTTGTTCT TTCAGTTGCA CCCAAGCCTG CTCCTCCTGG 
GCCCTCTGGG 35280 

CCTGGACCTC AGCCTGCCGC AGCTCCTGGG CCTGTGCTTC GAGCTGCAAC 
CTAGCTATCC 35340 

TGCGAGGAAA AAGGAGCCGA GAATGAGCGG GGCCGTGGGC CGCGCACGGG 
CAGAAGGAGC 35400 

AGGGTTGGGG GCTGGCCCCC GGGCCGCGCG GCCAGGCGGT TCAAAGCCCG 
CAACATGACA 35460 

GCTAGCGCCT GCGCCCTGTC CGGAAGTGGT CGAGGAGCAT CACGGGAATC 
TCCTGTCCCC 35520 

GCCCCCGTAG GCGGAGCTCT TGCTGTCTGG CCCATCGCGT CCCTGGAGTC 
TCGGGTAGAG 35580 

TGGGAGGGTC GAGGGGCACC TTGGAGACCA CAGTTCGCTG GGCTGTATTG 
CCCGCCAATG 35640 

ACAGCTACAA CACCAGGCGC TCTTGCCTAC TTGTTCACAA GCTTGGACAC 
CGCCCCAAAG 35700 

CGTGTGGTTC TCTGGCCCAC CGGTGGCCTG TGTACCCTTC CAGGGCTGAT 
GCAAGAAATT 35760 

CTGAGCTTCT AGCTACTTAA GTTAGGCTTA CACTTCCTCT TTGAGTAAGC 
GTTCTCCTGA 35820 

TGCACTTTAA ACTTTAGGTT TGCAATCGAG CAACAAACCT TTTCCTTGCA 
TTGCACTGCC 35880 

CTGTTGTGGT CTTGGGCAGC TCTGCAATTA ATTGCCAGAG TTCTCAGTAT 
CAGTGATCAC 35940 

TTGTTTTCTA CTGGGTGGCC TGTGTGAGGT GACGCTTGTC CTCTTTCACC 
TGAGCCGGGG 36000 
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TCTTCCCACC TTGAGACCAT TTATGAAGCT TTTAAAGTAC TATCTTCCTC 
TACAATGAAG 36060 , 

AGAAACAGGA GGCAGTGCAG GGCAGAGAGC ATCTGTCCCA AAGGTAGATG 
GCTCTCCTCT 36120 

GGGAGAGTTG TTGGTAATCA GAAGCTGACG GTGAGGGCTA GGGAACTGAA 
GTCTCATCCT 36180 

ACTCATTAGT GTTGTCAACA CCAGATTCTG CAGAGAATCT GCACTTAGGA 
GGGTCTGAGG 36240 

AGCCTGGGCT GACTGCAATA CCTGATATTT CTCAGAAGAG AACAGAGTCT 
GCTTACATCC 36300 

TTCTCCCAAC TTCCAGGCCC GTAGGAGGCC CAGCCAGCAC CCTCTTCCTC 
TCATCTCACC 36360 

CTACCCTGGA GTGACACAGG GTTGCTCTGA ACATCAGGGA ACGTGGCACT 
CCCATCCTTT 36420 

CTTGCAACAG GTCTCACTAT ATAGCTCCGG ATGGTCTGTC ACTTACAATG 
TATATCAGAC 36480 

TCACAAGAGG TCCATCTGCC ATTGCCTCCT AAATGCTGGG GTTAAAGGCA 
CATACCACCA 36540 

CACCTGTCCT AAACCTTTCT TCTTCGGGGT CATCCTAGAT AACCAGTATC 
TCATTTCAGA 36600 

TAACTTCAGT GTCTGGGCAA AGAGAATATT TCTATGGTGT GGGTCATTCC 
TAGAGGCTTC 36660 

CTAACCTTGC TGGCTCTGAC GTTCTCTCGG CTGGTCAGGT CTACTCATCC 
TTCTTTCAGA 36720 

GGGTTTCATA AGTTGTAAGA GATTTAGGCC TACGGTGGAT GAAAGATGTG 
GAGTCATTTT 36780 

GAGTAGCTAA TGCTACAGAA CTAGAAGGCA GGTTCTCTGC CCCCTTCTCT 
GACCTGTTGG 36840 

GGAAGTGGAA GTAACTTTCC ATTTGTGACC TTCCCCACTA GGTGGCGAGA 
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TAGAATTGTC 36900 

AAAGCTGGGA AAGGAGGCTT TTCTGGGCAG TTCATGGGTA GAAGGACAGA 
CAGACAGAGT 36960 

TGGAGGAATG GAAGCCTCCT CATTTACCAA GGGGTAAACT ATGGATGAGG 
TGACTTCAGG 37020 

TGCCTGCAGG ACCCTATGCA GACGGTCCCA GGATTTAATG ATCAGGCCAT 
TCTATTTCCT 37080 

CTGGTGTCAA ATCCAGTGAT ATCATTAAAA CAAAAACAAA AAAGCCCCAA 
TCAGGGTCTT 37140 

ACTTGATGGC CTTATATTTC CAACAAAGCC CAGGCTGGCC TTGAACTTGA 
AGCAATATCC 37200 

CTGCATCTGT CTCCAGAGTG CTAAGATTGT GTGTGTCACC ATACCAAGGT 
ACAGTGATCT 37260 

CTTGAAACAG GGAGGTGCAA GTCATTACTC AAACCCCTCC TCACAATGTT 
CTATGAGCAA 37320 

ATCCGAAGTT GATGTTGGCT TTTAAAGTCA CCAGACAAGT GTCCTTCTGC 
TTAGATCTTC 37380 

CTAGGAACTG AGGTTTGAAA CAAAAAGCAT AACATGGTTG GAGAGATGGC 
TCAGTAGTGA 37440 

AATTCTGAAT GTGGTTCCCA GCATCCACAT TGGGCACCTC AGAATGGCCT 
ATAACTTCAA 37500 

TTCTAGGGAC CAAGTAATCT CTTCTGGCTT ATGGGTGTCA CTCACATGCA 
TGTGTGCATA 37560 

TGGTGCCTAA GTAAAAAAAT AATCTTTTAA AAGCAGATTT TAAAAAAAAT 
TTCAACGATT 37620 

TTTTTTTAAT GTTCATTGGT GTTTTGCCTG CATGTATATC TGTGTGAGGG 
TGTCAGGTTT 37680 



CCAGGAACTG GAGTTACAGA CAGATGTGAG CTACCTGTTG GTGCTAGGAA 
TTGAACCCAG 37740 
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GTCCTCTGGA AGAATAATCA GTGCTCTTAC CCACTGAGCC ATCTCTCCAA 
CCCAAATACA 37800 

TCTTAAAAAA AATTAAAACA GTGGACCTGC CTTCTAGTTC TGTAGCATTA 
GCTACTCAAA 37860 

ATGACCCACA TCTTTCATCC ACCGTAGGCC TAAATCTCTT ACACTTATGA 
AACCCTCTGA 37920 

AAGAGGATGA GTAGACCTGA CCAGCCGAGA GACATCAGAG CCAGCAAGGT 
TAGGAAGCCT 37980 

CTAGGAATGA CCCACACCAT AGAAATATTC TCTTTGCCCA GACACTGAAG 
TTATCTGAAA 38040 

TGAGATACTG GTTATCTAGG ATGAACCCCG AGAGAAGAAA GGTTTAGGAC 
AGGTGTGGTG 38100 

GTATGTGCCT TTAACCCCAG CATTTAGGAG GCAATGGCAG ATGGACCTCT 
TGTGAGTCTG 38160 

ATATACATTG TAAAGGGGAG AACTCCCGGA ATTTGTTCTC TGACCTACAC 
ATGTGACATG 38220 

CATGTGTTCG TGCACACACA CATACACACA CACACACTGT AAAAATGCAA 
AATGGCTACC 38280 

AAGTGGTCAT TGAGCTTCTC AACCTCACTG ACAGCTACAT TATTATATAG 
ACTTACTGGG 38340 

AACAGATCCG CAGGAAATTA TTTGGAATCT TTTTCTTTTC TCTAACGGGG 
GCTGATCTGG 38400 

AACTTCTGAG CCTTTTTGTT CCCTATCATG AATGCTGGGA TGGCAGGCGT 
TTCCACATGA 38460 

CTCGTTCGAT GTAGTATTGC AGACTGAACG CAGGACTTTC CACACACTAA 
GCAGGCATTC 38520 

TGTGAACTGT TACGTCTCCA GCCCCATTTC TAAATTCTAA CACCAAAGTG 
CTAGTTTTGT 38580 

CCCTTGACCT GGACACTGCA GTGAGTTCAC AGAACTTATA ATCACCCTGT 
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TTAGTGTAGA 38640 

AGCTACCTCA ATCACCATGA CATTTTTCAA AAATGTGTTC ACTTTCCTCT 
TTAGAGTCCA 38700 

AGCACACCAA GCTTGGCGGA ACAATGATAC AGTCTAACTG GATCTGTTTC 
AAAATTGCAA 38760 

CTTGACTCTA CATCTAAATA GGTATGTGTT GTGACAAGTT TATTATGTTG 
TGTGTGTGTG 38820 

TACACATGTG CCACAGGAAG CCAAAGGACA ACTTGCTAGA GTGCATTTTC 
TTTCCTGGGA 38880 

ATTGGCCTCT GGTTGTCAGG CTTGGTAGCA CGCACTTTGA CCCTCTAAGC 
CATCTTGATG 38940 

GCCCAGAGAG TGAACCACGC TGTTTTCACT TTCCTACTTC TTGGGCTGAA 
TTCTCAAGTA 39000 

CCTGCCCTTG CAGCTTTGCA CCCTTCCTAA CTTCAAAAGG AAACTGACAT 
GGAGAAGGGT 39060 

GATACTTGAG GATTTCCTGG CTCACTTAGC TCAGGACTCT GGCCTAAGAA 
CAGGGAACCC 39120 

AGCAGTGTGA ACAGGGGTCC AAGAGAGTTC ATTTGTACTT ACCGGCAAAA 
CAGTGTGGCA 39180 

GGCTTCACAC AAATACATAC TCGGCACCAG GACAGGGCCA CTCTGGATGG 
AGGTGGGCTT 39240 

AGGTGGGGTA CTGCCCACCC AGGGTTGTCC TCTCTTGTAA GCAGACTCAT 
GGGGACAGCC 39300 

CAGAAGTGAT CCCACAGTCT CTCTGAAGCT GACAATAGGG GATAATTCTA 
AGTCCTCATC 39360 

CTGTGCTCAT CCACAGTCCT TTGTCGATCT GGACACTACT ATCATGGGCT 
GCTGGAAACA 39420 

GGTCTTTGCA GCCCAAGTCT GAGCCACTAG CTCTGCTTTC ACTGCCAGCC 
ATTAACCTCC 39480 
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GGGAGTGGGC GTGGGATAAG AAGAAACATT TATAGAGTCA ACGGCCAATC 
TGTATTTGGG 39540 

CTGAAAACCA TATTAAGGAA GGGCCAAGCC TGGCATAATG GTGACCAGAG 
CCACTAGGGG 39600 

ACCAACTGCA CCCAGCTTTA GCAAAGTGAC AGGCAGCATG AGGTACCATT 
ATGTGTGCTG 39660 

GGCATGCGGC TTCAGGATGG CTCTGTGACC TCCTAGAGGT TGTCTTATTG 
GCAGGCATAG 39720 

GAAACAAAGG CAGAGAATGA ATGCTACAGC CAGAGAGACC CAGATCTGCT 
AAGTGGATGA 39780 

CTCTTGTACA TATGTGTGTA TGTTGTTTTT GAGGCAGGGT CTCACTGTGT 
AGCTCTGACT 39840 

GTCCTGGAAT TGGATCTGTT GGTCTCAAGT TCAGATCCTA GTGGTTTATT 
TTTCCTGTGT 39900 

ATGTGTGCTT GTCATGCACA AGCATGTGTT AAGGTGAGTA GATATGTAGG 
CACATGGAGA 39960 

TCAGAACAAT GGTGTCACTC CAAACCTTTA TAGACCTATA TCCATCTTGA 
CATTAGGGTT 40020 

ACAGGTGTGT TCAACATAGA TATGGCCAAA ATTTAATGTG GGTTCTGAAG 
ATCTAAATAT 40080 

GTCTTGTGCT GGCTAGTTCT ACGTCAACCT GACACAAGCT AGAGTTATCT 
GAAGGAAGGG 40140 

AACCTTAGTA GAGAAACTGT CTCCATGAGA TCCAGCTGTA TAGCATTTTC 
TTAATTCTTA 40200 

GTTAGAGACT AATGGGGGAG GGCCCAGTCC ATTGTGGGTG ATGCAACCTT 
AGACAGGTGA 40260 

ACCTGGGTTT TGTAAGAAAG CAGGCTGAGC AAGCCATGAG GAAGCAAGCC 
AGTAAGCAGC 40320 

ACTGACCATG GCCTCTGCAT CAGCTCCTGC CTCCAGGTTC CTGCCCTGTT 
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TGAGTTCTTG 40380 

TCCTGACCTC CTTCCGTGAT GAACAGTGAT ATGGAAGTAT AACCAAATAA 
ACCCTCTCCT 40440 

CCCAAGTTGC TTTGGTCATG GTGTTTCATC ACAGCAACAG AAAGCCCAAC 
TGAGGCAGGT 40500 

TTTCATGCTG TATAACAAGC TGAACCATCT TACCAGCTCC ATAGTGTTTA 
TTTTAAAAGA 40560 

TGAGTGTGTA ACTTTCCTTT TTTTCCTTTT AAAAATCCAA AGAACCACGT 
TCCTCAGGAA 40620 

AAGCTCTGGG CCAGTTCTCC TGGTAACTTT GAAGTCTTTT TAAAGGCAGA 
GTCTATGTTA 40680 

GACAAGCTGG CCTCAACCTC ACAGAGATCA CCTCCCTCTG CCCGTTAACT 
GCCTGGTGAG 40740 

CTACAATGTG TTTTTAAAGA TGTCCCTGTT CCCTCTTAAA CAACTCCAAT 
TTCACCCATG 40800 

TGTTCCCATT TGGTAGGACA GGAAGCCATT TGTTCATCAT GAAGCTTCTG 
CTGATGTCAG 40860 

GACAGGCGCG CGCGCGCACA CACACACACA CACACACAGC AGCTTTAGTC 
ATTTGTGGTC 40920 

AGCTGGGAAA ATGGGAAAAC ACGGTTGGAG CTGAGTTGAA CTGAAGAGTT 
GGTGGAGACA 40980 

CATGGTGCAA ATCCTGAGCA GTAGCTGAAG GAAAGGTACA AGTTTGGCAG 
TAGATTGGCC 41040 

AATGAGGTGC AGAGATAAAG CAGAAGGGCT GCCCCGAGAG CTGCAGCATG 
GTGCGTGGAA 41100 

CCCTTCAGGA GGTAGAAAGG TAGAAAGGGC TGCTTGGACT ACTAGTGTGT 
AGATTACTGT 41160 

CTTTCAGCAG GTGAAAGACA AGGCTAGAGC CTGTGATTGG ACAGTAGAAA 
AGGAGGGCGG 41220 
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GCTGAGAGTT TGAGAGTCTG GAGGGATAGG AGGAAAGAAG GAAGATGGAG 
GAAGAGAAGG 41280 

ATGACCCAGA GCTGTGTGGC TTTAAATAGC CACAGGTAGC TATGAATATC 
ATATAAGGGG 41340 

TGGATTATGA CAGGACAATT TGTCCACTCA AGGTGGGCAG CTTATATCAT 
ATTAATTGGC 41400 

TCTGAGTTCT TTGTCTTGGG CATTTTGTGA GCTGAGAATT TACTGATATA 
AATCTGACTG 41460 

ATAAATTACA AGCCTCTAGA GTTTTGATTT TACTGGGTTA CAGGGATTTG 
TGACAGTTAA 41520 

CTGCGAGATG CTACAGCCAG AGAGACACGG ATTCTGCTAA GTAGATGACT 
CTTGTACATA 41580 

TGTGTGTATG GGGGCTAGCT GTGAAGGCAG TGAAACTGCT GCCAGGGCCA 
GAGAGTAGTT 41640 

GGCACTACTG TGGGATGGTG CATCCATTTT TTTAAAAATT ATTTAATGCA 
ACACTAGTGA 41700 

GTCATCCAGT AGGAAATGCT GGGGTCTGGG GAGCTGGGGG TGGAGGAAAG 
CCACAAGCCC 41760 

ACGGAGCCCC AGATCCCCCA CCTCTTTGGA GAATAACACT GATATCAGTG 
ACTCAGACAC 41820 

AATAGATCTT GGGGTTCAGC ACCCAAGCTC CTCTAGTAAG CATGGGTGCA 
AAAGGTGTGG 41880 

AATGGAGAGT GAAGGAAGAC TTTTTCATAA GCCTGTCACA AATGAGGAGG 
AAGCTAAGCT 41940 

TGGGAAATGC AGGCCTTCAG TGGCAGACCA AGTGGAGTCA ATGAAGTAAG 
GTCTGAGTAG 42000 

AAGGGCTCTG GGTGTGCGCT TCAGGCTGGG TGCACACTTC TTTCTGAGGA 
AATGCTCACT 42060 

TCCACTTTGA CCATTCCCTG ACCCAGGTCA TAGCTGATGT GCCAGAGTGT 
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CATGGGTGAA 42120 

GTGGTCACTT TCGCTCTTTC CACACAACTG TGCTGTGTAA GACACCCTTT 
CTCTGGTCAT 42180 

ACAGGAGTCC CCTGTGGGGT TTGAGCCCTG ACTTAAAAAG AAAGGATGAG 
GGCTACTTCT 42240 

GTGGAAGGGA GCAAAGAGCA GAGGTCATTC CTGCTGGAGG AGATCTGCTA 
ACAAGCATGT 42300 

GATGTTTAAC ATTAAGGGCT GCTCATCAAG TCAGCACTGA CTCCAGCAGA 
GTCCTGTCGA 42360 

GGCTACTCCA GTATGCCCTG GTCAAGACTA GCCTTGGCAA GGGAGCAGCC 
TGGGCTGTTG 42420 

CTAGGTGGAT AGAAGC ACAC ACAGAGGATT TTCTTAGTGG TGTATGTAAT 
CACTGAGGTC 42480 

TTGCTGACCC AGTAGGCATA CTCCTCCATT GCTAGACTCA GTCACACAAA 
GTGTACAAGA 42540 

ACAGGGCATT CTTCATGGAA AATTCCTGAC TGGGTCTTTT AGAGCTCCAG 
TTCCTAGAGG 42600 

GGCAGATGAT CCCAGTGACT TATGCTCAGT GTAAAGCTGG TCTGCTGTCA 
CATCTTTGCT 42660 

CCCAAAGGTT TCTGGGATTC CTCCTGTACT TTCTTCTATT TTTATTTTCA 
AGACAGGGCT 42720 

TCTCTGTGTA GTCCTGGCTG CTCTGGAACT TGCTCTGCAG ACCAGGTTGG 
CCTCAAATTT 42780 

ATAGAGATCC ACTTGCCTCT GCCGTCCAAG TACAGGGATT AAAGTTGCAT 
GCCACCACTG 42840 

CCCAGCCTCT CTAAAATTTT CTTAATTAAT TTATTTTTCA AGACAGAGTC 
TCACTATGTA 42900 

GTCCTGGATA TGCTGGAACT CAGTAATGTA GAACAGGCTG TCCTTGAACA 
TACAGAGTTC 42960 
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CACCAACCTC TGCTTCCAAG TGCTGGGATT GAAGTGTGTG CCACTATGCC 
CAGCTAAAAC 43020 

CTGTTTTATT TTCTGTGCAT GGGTGTTTGC CTGCATGTAT GTCTGTGCAT 
CATTTGCCTG 43080 

ACTGGTGCCC ACGGAAGTCA GAGGAGGACA CTGGATCCCC TGAGGTGCCC 
ACGGAAGTCA 43140 

GAGGAGGACA CCGGATCCCC TGCAGTGCCC ATGGAAGTCA GAGGAGGACA 
CCGGATCTCC 43200 

TGGATCTGGA TGACTGAGCC ATCACATGGG TCTTGGGAAA AGATCCCGGG 
TTTGCTCTAA 43260 

GAACAAGTGC TCTTAATGAT TGAGATGTCT CTCTATCCCA TGTTTCTTTG 
TACACAAACA 43320 

CCATGGACAC GTGGCATACA CTGGGCTTCC TTTTCACACC ACTCTGTCGA 
ACTTAAATTC 43380 

TGCTGGCGGC TCCAACTGAC CTTTCCTTTC TATTCCTAAA TTCTCGGCAT 
GGCTTGGGTC 43440 

TGGTTAAGTC CCCCCTTTTC CAAGCAGCCG GAAGCACTTA TCTCTGAATG 
TGCCTCTGTG 43500 

GGACACACCG GGGGACCTGC TGAAGCCTCT GAAGAGCAGA GGTGATGTCT 
GCCTCCCCAT 43560 

CTTTGCCCTC TTGTGCTAAG AAGCTACTTG TGATGCTGGA GGTGGTGGGG 
AAAACCCACC 43620 

AGCCTTGCCA CCTGAAGTGA AGGGCAGCCA CGGCCTGTGT CCTAGCCAGT 
GGGGATTAGT 43680 

GAAAATGGTA AAGTGGGCAA CGAGGCTGCT TGCTTTCTGA GCTTCCTCCT 
ATTTTGGGTT 43740 

GGTAGCAGCA GCGGCCCAGT TCCTTCCCAC TGTGGGGATG AGGAGTACGC 
CCTCAGGATG 43800 

CCGGCATCAG AGAAGGCAAG AACAGACGCA GTGTCGCACG TCTTCAATTA 
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CAGCACTTGG 43860 

GAGGCAGAGA CAGGCAGATC TCTGCGAGTT CAAGGCCAGT CTGGTCTACA 
CAGTGAGTTC 43920 

TAGGTTCGTC TGTGTTACAC AGGGAGAACT GTCTGAAGAA ACAAACAAAG 
AGAAAATTAA 43980 

AGTTAGATGT AGTGGCACGG TCATAATCTA AAATGTGGCC TAGCTGTTCT 
CTGTTCTCTG 44040 

TTTCTCTTTC TTCCTCCCTC CCTCTCTCTT CTTCATTGTC TGTCTGTCTG 
GTGCTTGTAT 44100 

ATCAAAATGT AAGTTCTAAG ATATGCTTCA GCACCGTGCC TGCCTGCCTG 
CCGCCATGCT 44160 

CCACCATGAT AGTCATAGAC CCACCCTCTC GAACTGTGAA TCCCAAATTT 
ACTTTCTTCT 44220 

ATGAGTTGCC CTGGTTATGG TGCCTTATCA CAGCAACAGA GCAGTGAGTA 
ATATACCCAC 44280 

CCTCAAAGAC AAGCTGAAAG AGAGACCCAT GTGCTGTGGC ATGCGTGTGC 
CTACACTTAA 44340 

CACACATAAA TAAATACATC TCCTGAAGAA AATTTAAAAG TTATTCTGGA 
CAGAAACTAG 44400 

AGAGGCCAGA CTGGCCTCAG CTCAAGCCCA CAGCAGCTCC TCTGTCCTGC 
TGTCCTTTCC 44460 

TGTAGAGAAA TTCAGTGAGA CCCAAGCTGT CTGTCCTAGG GCTATAAGCT 
GGGTGGGTGG 44520 

CTGGGATGAC CACACTTGAT AGAAAAGAGG AAAAGGAACT GGGAGTTGCG 
GCCGCC 44576 

(2) INFORMATION FOR SEQ ID NO: 1 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 



# 

77 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
GGACAGCCCG AAGGACTACA GGT 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

CGAAGAACTC CGCAGGGTCC 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AAGACCCGCC ACGACCCG 
(2) INFORMATION FOR SEQ ID NO: 21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GAATCAGCAC CCTCTCCGCC 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TGCGGAGTTC TTCGTGCTGA TGGAG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGTGCTCGGC GGCGTCCTTC 
(2) INFORMATION FOR SEQ ID NO: 24: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGTGGCGGA GAGGGTGCTG A 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GGCCGAGGCT GAGCGGGG 
(2) INFORMATION FOR SEQ ID NO: 26: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
CTGAAGGACG CCGCCGAGCA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CTCCAACGCC TGCCGCTGC 
(2) INFORMATION FOR SEQ ID NO: 28: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
GCAGGAGGAG CGGGAGCAGG A 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "Synthetic Primer" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TCCAGTGCCC CGCAAGCCG 



